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DNA SEQUENCES, RECOMBINANT DNA MOLECULES 
AND PROCESSES FOR PRODUCING SOLUBLE 

T4 PROTEINS 



5 TECHNICAL FIELD OF INVENTION 

This invention relates to DNA sequences, 
recombinant DNA molecules and processes for producing 
soluble T4 proteins. More particularly, this invention 
relates to DNA sequences that are characterized, in 

10 that they code on expression in an appropriate uni- 
cellular host for soluble forms of T4, the receptor 
on the surface of T4 + lymphocytes, or derivatives 
thereof. In accordance with this invention, the DNA 
sequences, recombinant DNA molecules and processes 

15 of this invention may be employed to produce soluble 

T4 essentially free of other proteins of human origin. 
This soluble protein may then advantageously be used 
in the immunotherapeutic, prophylactic, and diag- 
nostic compositions and methods of this invention. 

20 The soluble T4 protein-based immunothera- 

peutic compositions and methods of this invention 
are useful in treating immunodeficient patients suf- 
fering from diseases caused by infective agents whose 
primary targets are T4 + lymphocytes. According to a 

25 preferred embodiment, this invention relates to solu- 
ble T4 protein-based compositions and methods which 
are useful in preventing, treating or detecting 
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acquired immune deficiency syndrome, AIDS related 
complex and HIV infection. 

BACKGROUND ART 

The class of immune regulatory cells known 
5 as T cell lymphocytes can be divided into two broad 
functional classes, the first class comprising T 
helper or inducer cells — which mediate T cell pro- 
liferation, lymphokine release and helper cell inter- 
actions for Ig release, and the second class compris- 

10 ing T cytotoxic or suppressor cells — which parti- 
cipate in T cell-mediated killing and immune response 
suppression. In general, these two classes of 
lymphocytes are distinguished by expression of one 
of two surface glycoproteins: T4 (m.w. 55,000-62,000 

15 daltons) which is expressed on T helper or inducer 
cells, probably as a raonomeric protein, or T8 (m.w. 
32,000 daltons) which is expressed on T cytotoxic or 
suppressor cells as a dimeric protein. 

The primary structures of T4 and T8 have 

20 been deduced from their respective cDNA sequences 

[P. J. Maddon et al., "The Isolation and Nucleotide 
Sequence Of A cDNA Encoding The T Cell Surface Protein 
T4: A New Member Of The Immunoglobulin Gene Family", 
Cell , 42, pp. 93-104 (1985); D. R. Littman et al., 

25 "The Isolation And Sequence Of The Gene Encoding T8 : 
A Molecule Defining Functional Classes Of T Lympho- 
cytes", Cell , 40, pp. 237-46 (1985)]. Both predicted 
protein sequences define molecules with domains 
expected for surface antigens, including transmem- 

30 brane and intracytoplasmic domains at the carboxyl 

end of the protein. In addition, both proteins con- 
tain an amino terminal region which shows striking 
homology to immunoglobulin and T cell receptor 
variable regions and which might function during 

35 target cell recognition [ Maddon et al . , supra ] . 
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In immunocompetent individuals, T4 lympho- 
cytes interact with other specialized cell types of 
the immune system to confer immunity to or defense 
against infection [E. L. Reinherz and S. F. 
5 Schlossman, "The Differentiation Function Of Human 
T-Cells", Cell , 19, pp. 821-27 (1980)]. More speci- 
fically, T4 lymphocytes stimulate production of growth 
factors which are critical to a functional immune 
system. For example, they act to stimulate B cells, 

10 the descendants of hemopoietic stem cells, which 
promote the production of defensive antibodies. 
They also activate macrophages ("killer cells") to 
attack infected or otherwise abnormal host cells and 
they induce monocytes ("scavenger cells") to encompass 

15 and destroy invading microbes. 

It has been found that the primary target 
of or receptor for certain infective agents is the 
T4 surface protein. These agents include, for 
example, viruses and retroviruses. When T4 lympho- 

20 cytes are exposed to such agents, they are rendered 

nonfunctional. As a result, the host's complex immune 
defense system is destroyed and the host becomes 
susceptible to a wide range of opportunistic infec- 
tions . 

25 Such immunosuppression is seen in patients 

suffering from acquired immune deficiency syndrome 
("AIDS"). AIDS is a disease characterized by severe 
or, typically, complete immunosuppression and 
attendant host susceptibility to a wide range of 

30 opportunistic infections and malignancies. In some 
cases, AIDS infection is accompanied by central 
nervous system disorders. Complete clinical mani- 
festation of AIDS is usually preceded by AIDS 
related complex ("ARC"), a syndrome accompanied by 

35 symptoms such as persistent generalized lymphadeno- 
pathy, fever and weight loss. The human immunode- 
ficiency virus ("HIV") retrovirus is thought to be 
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the etiological agent responsible for AIDS infection- 
and its precursor, ARC [M. G. Samgadharan et al., 
"Detection, Isolation And Continuous Production Of 
Cytopathic Retroviruses (HTLV-III) From Patients 
5 With AIDS And Pre-AIDS", Science , 224, pp. 497-508 
(1984)] .* 

Between 85 and 100% of the AIDS/ARCS popu- 
lation test seropositive for HIV [G. N. Shaw et al., 
"Molecular Characterization Of Human T-Cell Leukemia 

10 (Lymphotropic) Virus Type III In The Acquired Immune 
Deficiency Syndrome", Science , 226, pp. 1165-70 
(1984)]. The number of adults in the United States 
infected with HIV has been estimated to be between 1 
and 2.5 million [D. Barnes, "Strategies For An AIDS 

15 Vaccine", Science , 233, pp. 1149-53 (1986); M. Rees, 
"The Sombre View Of AIDS", Nature , 326, pp. 343-45 
(1987)]. These estimates include 64,900 individuals 
who do not belong to an identified group at risk for 
AIDS [S. L. Sivak and G.'P. Wormser, "How Common Is 

20 HTLV-III Infection In The United States?", New Eng. 
J. Med. , 313, p. 1352 (1985)]. The apparent annual 
rate of diagnosis for those infected with HIV virus 
is between 1 and 2% — a rate which may increase 
significantly in future years. 

25 The genome of retroviruses, such as HIV, 

contains three regions encoding structural proteins. 
The gag region encodes the core proteins of the 
virion. The pol region encodes the virion RNA-depen- 
dent DNA polymerase (reverse transcriptase). The 

30 

* In this application, human immunodeficiency 
virus ( "HIV" ) , the generic term adopted by the human 
retrovirus subcommittee of the International Committee 
On Taxonomy Of Viruses to refer to independent iso- 
35 lates from AIDS patients, including human T cell 
lymphotropic virus type III ( "HTLV- III"), lympha- 
denop a thy- associated virus ("LAV"), human immuno- 
deficiency virus type 1 ("HIV-1") and AIDS-associated 
retrovirus ( "ARV" ) will be used. 
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env region encodes the major glycoprotein found in 
the membrane envelope of the virus and in the cyto- 
plasmic membrane of infected cells. The capacity of 
the virus to attach to target cell receptors and to 
t 5 cause fusion of cell membranes are two HIV proper- 

ties controlled by the env gene. These properties 
j are believed to play a fundamental role in the patho- 

genesis of the virus. 

HIV env proteins arise from a precursor 
10 polypeptide that, in mature form, is cleaved into a 
large heavily glycosylated exterior membrane protein 
of about 481 amino acids — gpl2 0 — and a smaller 
transmembrane protein of about 345 amino acids which 
may be glycosylated — gp41 [L. Ratner et al., 
15 "Complete Nucleotide Sequence Of The AIDS Virus, 
HTLV-III", Nature , 313, pp. 277-84 (1985)]. 

The host range of the HIV virus is asso- 
ciated with cells which bear the surface glycopro- 
tein T4. Such cells include T4 lymphocytes and 
20 brain cells [P. J- Maddon et al . , "The T4 Gene Encodes 
The AIDS Virus Receptor And Is Expressed In The 
Immune System And The Brain", Cell , 47, pp. 333-48 
(1986)]. Upon infection of a host by HIV virus, the 
T4 lymphocytes axe rendered non- functional . The 
25 progression of AIDS/ARCS syndromes can be correlated 
with the depletion of T4 + lymphocytes, which display 
the T4 surface glycoprotein. This T cell depletion, 
with ensuing immunological compromise, may be attri- 
butable to both recurrent cycles of infection and 
30 lytic growth from cell-mediated spread of the virus. 
» In addition, clinical observations suggest that the 

HIV virus is directly responsible for the central 
* nervous system disorders seen in many AIDS patients. 

The tropism of the HIV virus for T4 + cells 
3 5 is believed to be attributed to the role of the T4 
cell surface glycoprotein as the membrane -anchored 
virus receptor. Because T4 behaves as the HIV virus 
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receptor, its extracellular sequence probably plays 
a direct role in binding HIV . More specifically, it 
is believed that HIV envelope selectively binds to 
the T4 epitope(s), using this interaction to initiate 

5 entry into the host cell [A. G. Dalgelish et al., 
"The CD4 (T4) Antigen Is An Essential Component Of 
The Receptor For The AIDS Retrovirus", Nature , 312, 
pp. 763-67 (1984); D. Klatzmann et al., "T- Lymphocyte 
T4 Molecule Behaves As The Receptor For Human Retro- 

10 virus LAV" , Nature, 312, pp. 767-68 (1984)]. Accord- 
ingly, cellular expression of T4 is believed to be 
sufficient for HIV binding, with the T4 protein 
serving as a receptor for the HIV virus. 

The T4 tropism of the HIV virus has been 

15 demonstrated in vitro . When HIV virus isolated from 
AIDS patients is cultured together with T helper 
lymphocytes preselected for surface T4, the lympho- 
cytes are efficiently infected, display cytopathic 
effects, including multinuclear syncytia formation 

20 and are killed by lytic growth [D. Klatzmann et al., 
"Selective Tropism Of Lymphadenopathy Associated 
Virus (LAV) For Helper- Inducer T Lymphocytes", 
Science , 225, pp. 59-63 (1984); F. Wong-Staal and 
R. C. Gallo, "Human T-Lymphotropic Retroviruses", 

25 Nature , 317, pp. 395-403 (1985)]. It has been demon- 
strated that a cloned cDNA version of human T4, when 
expressed on the surface of trans fected cells from 
non-T cell lineages, including murine and fibroblas- 
toid cells, endows those cells with the ability to 

30 bind HIV [P. J. Maddon et al., "The T4 Gene Encodes 

The AIDS Virus Receptor And Is Expressed In The Immune 
System And The Brain", Cell , 47, pp. 333-48 (1986)]. 

During the course of HIV infection, the 
host mounts both a humoral and a cellular immune 

35 response to the virus. These responses include the 
appearance of antibodies which bind to a number of 
viral products and which exhibit neutralizing effect 
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or antibody dependent cellular cytotoxic functions 
[M. Guroff-Robert et al., "HTLV-I I I-Neutralizing 
Antibodies In Patients With AIDS And AIDS-Related 
Complex", Nature, 316, pp. 72-74 (1985),- D . D. F. 
5 Barin et al., "Virus Envelope Protein Of HTLV-I II 
Represents Major Target Antigen For Antibodies In 
AIDS Patients", Science , 228, pp. 1094-96 (1985); 
A. H. Rook et al . , "Sera From HTLV-III/LAV Antibody 
Positive Individuals Mediate Antibody Dependent 

10 Cellular Cytotoxicity Against HTLV-III/LAV Infected 
T Cells", J. Immunol. , 138, pp. 1064-68 (1987)]. 
Epitopes of the HIV envelope have been identified as 
important determinants in eliciting a neutralizing 
antibody response. And, determinants in antibody 

15 dependent cellular cytotoxicity ("ADCC") activity 
include HIV eny and, possibly, gag epitopes. 

In the absence to date of effective treat- 
ments for AIDS, many efforts have centered on preven- 
tion of the disease. Such preventative measures 

20 include HIV antibody screening for all blood, organ 
and semen donors and education of AIDS high-risk 
groups regarding transmission of the disease. 

Experimental or early-stage clinical treat- 
ment of AIDS and ARCS conditions have included the 

25 administration of antiviral drugs, such as HPA-23, 

phosphono formate, suramin, ribavirin, azidothymidine 
( "AZT" ) and dideoxycytidine, which apparently inter- 
fere with replication of the virus through reverse 
transcriptase inhibition. Although each of these 

30 drugs exhibits activity against HIV in vitro , only 
AZT has demonstrated potential benefits in clinical 
trials. AZT administration in effective amounts, 
however, has been accompanied by undesirable and 
debilitating side effects, such as bone marrow 

35 depression. It is likely, therefore, that hemato- 
logic toxicity will be a major rate limiting factor 
in the long term use of AZT. 
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Other proposed methods for treating AIDS 
have focused on the development of agents having 
activity against steps in the viral replicative cycle 
other than reverse transcription. Such methods 
5 include the administration of interferons or the 

application of hybridoma technology- Most of these 
treatment strategies are expected to require the 
co-administration of immunomodulators , such as inter- 
leukin-2 . 

10 To date, the need exists for the develop- 

ment of effective immunotherapeutic agents and methods 
for the treatment of AIDS, ARCS, HIV infection and 
other immunodeficiencies caused by T lymphocyte 
depletion or abnormalities. 

15 DISCLOSURE OF THE INVENTION 

The present invention solves the problems 
referred to above by providing, in large amounts, 
soluble T4 and soluble derivatives thereof that act 
as receptors for infective agents whose primary target 

20 is the T4 surface protein of T4 + lymphocytes- Advan- 
tageously, this invention also provides soluble T4 
essentially free of other proteins of human origin 
and in a form that is not contaminated by viruses, 
such as HIV or hepatitis B virus. 

25 As will be appreciated from the disclosure 

to follow, the DNA sequences and recombinant DNA 
molecules of this invention are capable of directing, 
in an appropriate host, the production of soluble T4 
or derivatives thereof. The polypeptides of this 

30 invention are useful, either as produced in the host 
or after further derivatization or modification, in 
a variety of immunotherapeutic compositions and 
methods for treating immunodeficient patients 
suffering from diseases caused by infective agents 

35 whose primary targets are T4 + lymphocytes. According 
to various embodiments of this invention, such cornpo- 



WO 89/01940 



PCT/US88/02940 



-9- 



10 



sitions and methods relate to a soluble receptor for 
HIV, soluble T4 proteins and polypeptides and anti- 
bodies thereto. The soluble T4 proteins and polypep- 
tides of this invention include monovalent, as well 
as polyvalent forms . 

The compositions and methods of this inven- 
tion, which axe based upon soluble T4 proteins, poly- 
peptides or peptides and antibodies thereto, are 
particularly useful for the prevention, treatment or 
detection of the HIV-related infections AIDS and 
ARC. More specifically, the soluble T4-based com- 
positions and methods of this invention employ 
soluble T4-like polypeptides — polypeptides which 
advantageously interfere with the T4/HIV interaction 
15 by blocking or competitive binding mechanisms which 
inhibit HIV infection of cells expressing the T4 
surface protein. These soluble T4-like polypeptides 
inhibit adhesion between T4 + lymphocytes and infec- 
tive agents which target T4 + lymphocytes and inhibit 
20 interaction between T4 + lymphocytes and antigen pre- 
senting cells and targets of T4 + lymphocytes mediated 
killing. By acting as soluble virus receptors, the 
compositions of this invention may be used as anti- 
viral therapeutics to inhibit HIV binding to T4 
25 cells and virally induced syncytium formation at the 
level of receptor binding. 

This invention accomplishes these goals by 
providing DNA sequences coding on expression in an 
appropriate unicellular host for soluble T4 proteins* 
30 and soluble derivatives thereof. 



35 



* As used in this application, "soluble T4 pro- 
tein", "soluble T4" and "soluble T4-like polypeptides 
include all proteins, polypeptides and peptides which 
are natural or recombinant soluble T4 proteins, or 
soluble derivatives thereof, and which are charac- 
terized by the immuno therapeutic ( anti -retroviral ) 

(footnote continued on following page) 
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This invention also provides recombinant 
DNA molecules containing those DNA sequences and 
unicellular hosts transformed with them. Those hosts 
permit the production of large quantities of the 
5 novel soluble T4 proteins, polypeptides, peptides 
and derivatives of this invention for use in a wide 
variety of therapeutic, prophylactic and diagnostic 
compositions and methods . 

The DNA sequences of this invention are 
10 selected from the group consisting of: 

(a) the DNA inserts of pl99-7, pBG377, 
pBG380 f pBG381, p203-5, pBG391, pBG392, pBG393, 
pBG394, pBG395, pBG396, pBG397, p211-ll, p214-10 
and p215-7; 

15 (b) DNA sequences which hybridize to one 

or more of the foregoing DNA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

(c) DNA sequences which code on expression 
for a soluble T4-like polypeptide coded for on 

20 expression by any of the foregoing DNA inserts and 
sequences . 

According to an alternate embodiment, this 
invention also relates to a DNA sequence comprising 
the DNA insert of pl70-2, said sequence coding on 
25 expression for a T4-like polypeptide. And, this 

invention also relates to recombinant DNA molecules 
and processes for producing T4 protein using that 
DNA sequence. 



30 (footnote continued from preceding page) 

or immunogenic activity of soluble T4 protein. They 
include soluble T4-like compounds from a variety of 
sources, such as soluble T4 protein derived from 
natural sources, recombinant soluble T4 protein and 
35 synthetic or semi -synthetic soluble T4 protein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is an autoradiograph depicting 
the purification of T4 protein from U937 cells by 
immunoaf f inity chromatography. 
5 Figure 2 depicts autoradiograph and Western 

blot data demonstrating that immuno affinity-purified, 
solubilized native T4 protein binds to HIV envelope 
protein. 

Figure 3 depicts the nucleotide sequence 
10 and the derived amino acid sequence of T4 cDNA 

obtained from PBL clone \203-4. In this figure, the 
amino acids are represented by single letter codes 
as follows: 

Phe: F Leu: L He: I Met: M 

15 val: V Ser: S Pro: P Thr: T 

Ala: A Tyr: Y His: H Gin: Q 

Asn: N Lys: K Asp: D Glu: E 

Cys: C Trp: W Arg: R Gly: G 



20 present. 



* = position at which a stop codon is 
In Figure 3, the T4 protein translation 



start (AA_ 23 ) is located at the methionine at nucleo- 

ides 201-203 and the mature N-terminus is located at 

the lysine (AA 3 ) at nucleotides 276-278. 
25 Figure 4 is a schematic outline of the 

construction of cDNA clones pBG312.T4 (also called 

P171-1) and pl70-2. 

Figure 5 is a schematic outline of the 

construction of plasmid pEClOO. 
30 Figure 6 depicts amino acid comparisons at 

a positions 3, 64 and 231 of various T4 cDNA clones. 

Figures 7A and 7B depict the protein domain 

structure of purified, solubilized T4 protein and 

recombinant soluble T4 mutants . 
35 Figures 8A-8D are schematic outlines of 

constructions of various intermediate plasmids and 

other plasmids used to express recombinant soluble 

T4 ("rsT4' f ) of this invention. 
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Figure 9A is a schematic outline of the 
construction of plasmid p!99-7. 

Figures 9B and 9C are schematic outlines 
of the construction of plasmid p203-5. 
5 Figure 10 depicts the synthetic oligo- 

nucleotide linkers employed in various constructions 
according to this invention. 

Figure 11 depicts the nucleotide sequence 
of the entire plasmid defined by pl99-7 (P L mutet. rsT4 ) 
10 and its rsT4.2 insert and the amino acid sequence 
deduced from the rsT4 sequence. This includes the 
Clal-Clal cassette which defines the Met perfect 
rsT4.2 coding sequence. 

Figure 12 depicts a protein blot analysis 
15 of an induction of rsT4.2 expression from 
SG936/pl99-7. 

Figure 13 is a schematic outline of the 
construction of plasmid pBG368. 

Figures 14A-14C are schematic outlines of 
20 constructions of various plasmids of this invention. 

Figure 15 depicts the nucleotide sequence 
of plasmid pBG391. 

Figure 16 depicts the nucleotide sequence 
of plasmid pBG392. In this figure, the T4 protein 
25 translation start (AA_ 23 ) is located at the methio- 
nine at nucleotides 1207-1209 and the mature 
N-terminus is located at the lysine (AA 3 ) at nucleo- 
tide 1281-84. 

Figure 17 is a schematic outline of con- 
30 structions of various plasmids of this invention. 

Figure 18 depicts the synthetic oligonucleo- 
tide linkers employed in various constructions 
according to this invention. 

Figure 19 depicts the nucleotide sequence 
35 of plasmid pBG394. 

Figure 20 depicts the nucleotide sequence 
of plasmid pBG396. 
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Figure 21 depicts the nucleotide sequence 

of plasmid pBG393. 

Figure 22 depicts the nucleotide sequence 

of plasmid pBG395. 
5 Figure 23 is a Coomassie stained gel of 

rsT4.2 purified from the conditioned medium of the 
pBG380 transfected CHO cell line BG380G of plasmid 
pl96-10. 

Figure 24 is a schematic outline of the 
10 construction of plasmid pl96-10. 

Figure 25 is a schematic outline of the 
construction of plasmid pBG394. 

Figure 26 is a schematic outline of the 
construction of plasmid p211-ll. 
15 Figure 27 is a schematic outline of the 

construction of plasmid p2l5-7. 

Figure 28 is a schematic outline of the 
construction of plasmid p218-8. 

Figure 29A is a Coomassie stained gel of 
20 rsT4. 113.1 purified from the conditioned medium of 
pBG2 11-11 transfected E.coli . 

Figure 29B is an autoradiograph depicting 
a Western blot analysis of rsT4. 113.1 expressed in 
E . coli . 

25 Figure 30, panels (a)-(c) depict the puri- 

fication of rsT4. 113.1 from E.coli trans formants . 

Figure 31, panels (a)-(c) depict the 
refolding of purified rsT4. 113.1. 

Figure 32 is an autoradiograph depicting 
30 the immunoprecipitation of 35 s-metabolically labelled 
CHO cell lines producing recombinant soluble T4 . 

Figure 33 depicts an immunoblot analysis 
of COS 7 cell lines producing recombinant soluble T4. 
Figure 34 depicts in graphic form the 
35 results of a competition assay between rsT4. 113.1 
and rsT4.3 for binding to OKT4A or 0KT4 . 
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Figures 35-37 depict in graphic form the 
results of competition assays between rsT4.111 and 
rsT4.3 for binding to, respectively, 0KT4A, Leu-3A 
and 0KT4. 

5 Figure 38 depicts in graphic form an ELISA 

assay for rsT4. 113.1 from E.coli trans fo rmants . 

Figure 39 depicts in graphic form the 
results of a p24 radioimmunoassay using recombinant 
soluble T4 according to this invention. 
10 Figures 40 and 41 depict the results of 

syncytia inhibition assays using recombinant soluble 
T4 proteins according to this invention. 

Figure 42 is a schematic outline of the 
construction of plasmid pBiv.l. 
15 Figure 43 depicts the bivalent recombinant 

soluble T4 protein produced by pBiv.l. 

DETAILED DESCRIPTION OF THE INVENTION 

We isolated the DNA sequences of this 
invention from two libraries: a \gt cDNA library 

20 derived the T cell tumor line REX and a \gtl0 cDNA 
library derived from peripheral blood lymphocytes . 
However, we could also have employed libraries pre- 
pared from other cells that express T4. These 
include, for example, H9 and U937. We also used a 

25 human genomic bank to isolate various fragments of 
the T4 gene. 

For screening these libraries, we used a 
series of chemically synthesized anti-sense oligo- 
nucleotide DNA probes based upon the T4 protein 

30 sequence set forth in Maddon et al. (1985), supra. 

For screening, we hybridized our oligo- 
nucleotide probes to our cDNA libraries utilizing a * 
plaque- hybridization screening assay. We selected 
clones hybridizing to several of our probes. And, 

35 after isolating and subcloning the cDNA inserts of 
the selected clones into plasmids, we determined 
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their nucleotide sequences and compared the amino 
acid sequences deduced from those nucleotide sequences 
to the amino acid sequences referred to in Maddon 
e t al. (1985), supra. As a result of these compari- 
5 sons, we determined that all of our selected clones 
were characterized by cDNA inserts coding for amino 
acid sequences of human T4. 

We have depicted in Figure 3 the nucleo- 
tide sequence of full-length T4 cDNA obtained from 

10 deposited clone pl70-2 and the amino acid sequence 
deduced therefrom. That cDNA sequence was subse- 
quently subjected to in vitro site-directed mutagen- 
esis and restriction fragment substitution so that 
its cDNA sequence was identical to that of Maddon 

15 et al. 

After modifying our T4 cDNA sequence to be 
identical to that of Maddon et al . , we truncated 
samples of it in various positions to remove the 
coding regions for the transmembrane and intracyto- 

20 plasmic domains. The remaining cDNA sequences encoded 
a soluble T4 which retained the extracellular region 
believed to be responsible for HIV binding. 

We then constructed various clones charac- 
terized by such cDNA inserts coding for human soluble 

25 T4. Those cDNA sequences may be used in a variety 
of ways in accordance with this invention. More 
particularly, those sequences or portions of them, 
or synthetic or semi -synthetic copies of them, may 
be used as DNA probes to screen other human or animal 

30 cDNA or genomic libraries to select by hybridization 
other DNA sequences that are related to soluble T4. 
Typically, conventional hybridization conditions, 
e.g., about 20° to 27°C below Tm, are employed in 
such selections. However, less stringent conditions 

35 may be necessary when the library is being screened 
with a probe from a different species than that from 



WO 89/01940 



PCT/US88/02940 



which the library is derived, e.g., the screening of 
a mouse library with a human probe. 

Such cDNA inserts, portions of them, or 
synthetic or semi-synthetic copies of them, may also 
5 be used as starting materials to prepare various 

mutations . Such mutations may be either degenerate , 
i.e., the mutation does not change the amino acid 
sequence encoded by the mutated codon, or non- 
degenerate, i.e., the mutation changes the amino 

10 acid sequence encoded by the mutated codon. Both 

types of mutations may be advantageous in producing 
or using soluble T4's according to this invention. 
For example, these mutations may permit higher levels 
of production or easier purification of soluble T4 

15 or higher T4 activity. 

For all of these reasons, the DNA sequences 
of this invention are selected from the group con- 
sisting of: 

(a) the DNA inserts of pl99-7, pBG377, 

20 pBG380, pBG381, p203-5, pBG391, pBG392, pBG393, pBG394, 
pBG395, pBG396, pBG397, p211-ll, p214-10 and p215-7; 

(b) DNA sequences which hybridize to one 
or more of the foregoing DNA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

25 (c) DNA sequences which code on expres- 

sion for a soluble T4-like polypeptide coded for on 
expression by any of the foregoing DNA inserts and 
sequences . 

Preferably, the DNA sequences of this 
30 invention code for a polypeptide selected from the 
group consisting of a polypeptide of the formula 
AA_ 23 -AA 36 2 of Figure 3, a polypeptide of the formula 
AA 1 _ 362 of Figure 3, a polypeptide of the formula 
Met-AA 1 _ 362 of Figure 3, a polypeptide of the formula 
35 AA 1 _ 374 of Figure 3, a polypeptide of the formula 

Met-AA 1 _ 374 of Figure 3, a polypeptide of the formula 
AA, of Figure 3, a polypeptide of the formula 
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Met-AA 1-377 of Figure 3, a polypeptide of the formula 
AA 2 3 -AA 374 of 3 ' a PolyP e P tide of formula 

AA 23 -AA 377 of Figure 3 ' or P ortions thereof. 

DNA sequences according to this invention 
5 also preferably code for a polypeptide selected from 
the group consisting of a polypeptide of the formula 
AA. 23 -AA 1Q2 of Figure 16, a polypeptide of the 
formula AA-^AA^ of Figure 16, a polypeptide of 
the formula Met-AA 1 _ 182 of Figure 16, a polypeptide 
10 of the formula AA_ 23 -AA 1Q2 of Figure 16, followed by 

the amino acids asparagine-leucine-glutamine-histidine- 
serine-leucine, a polypeptide of the formula 
AA-.-AA-.o- of Figure 16, followed by the amino acids 

1 182 

asparagine-leucine-glutamine-histidine-serine-leucane, 

15 a polypeptide of the formula Met-AA 1-182 of Figure 16, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
the formula AA_ 23 -AA 113 of Figure 16, a polypeptide 
of the formula AA 1 -AA 113 of Figure 16, a polypeptide 

20 of the formula Met-AA^^ of Figure 16, a polypeptide 
of the formula AA_ 23 -AA 111 of Figure 16, a polypeptide 
of the formula AAj-AA^ of Figure 16, a polypeptide 
of the formula Met-AA 1-1;L1 of Figure 16, a polypep- 
tide of the formula AA_ 23 -AA 131 of Figure 16, a poly- 

25 peptide of the formula AA 1 ~AA 131 of Figure 16, a 

polypeptide of the formula Met-AA 1 _ 131 of Figure 16, 
a polypeptide of the formula AA _ 2 3" AA 145 ° f Figure 16 ' 
a polypeptide of the formula AA 1 _AA 1 45 of Fi ? ure 16 ' 
a polypeptide of the formula Met-AA 1-145 of Figure 16, 

30 a polypeptide of the formula AA„ 2 3' AA 166 of 16 ' 

a polypeptide of the formula AA^AA-^g of Figure 16, 
a polypeptide of the formula Met-AA^^g of Figure 16, 
or portions thereof. 

Additionally, DNA sequences of this inven- 

35 tion code for a polypeptide selected from the group 

consisting of a polypeptide of the formula AA _23~ AA 362 
of mature T4 protein, a polypeptide of the formula 
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^^1-362 of mature T4 protein, a polypeptide of the 
formula Met-AA 1 _ 3g2 of mature T4 protein, a polypep- 
tide of the formula AA 1-3?4 of mature T4 protein, a 
polypeptide of the formula Met-AA 1-374 of mature T4 
5 protein, a polypeptide of the formula AA 1 _ 377 of 
mature T4 protein, a polypeptide of the formula 
Met-AA 1 _ 3?7 of mature T4 protein, a polypeptide of 
the formula AA_ 23 -AA 3?4 of mature T4 protein, a poly- 
peptide of the formula AA_ 23 -AA 377 of mature T4 pro- 

10 tein, or portions thereof. 

DNA sequences according to this invention 
also code for a polypeptide selected from the group 
consisting of a polypeptide of the formula AA_ 23 ~AA 182 
of mature T4 protein, a polypeptide of the formula 

15 AA 1 -AA 182 of mature T4 protein, a polypeptide of the ' 
formula Met-AA 1-182 of mature T4 protein, a polypep- 
tide of the formula A A _ 2 3" AA ia2 of mature T4 protein, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 

20 the formula AA 1 -AA 1S2 of mature T4 protein, followed 
by the amino acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the 
formula Met-AA 1 _ 1Q2 of mature T4 protein, followed 
by the amino acids asp aragine- leucine -glutamine- 

25 hi stidine-serine- leucine, a polypeptide of the 

formula AA_ 23 «AA 113 of mature T4 protein, a polypep- 
tide of the formula AA 1 -AA 113 of mature X4 protein, 
a polypeptide of the formula Met-AA 1 _ 113 of mature 
T4 protein, a polypeptide of the formula ^.23*^111 

30 of mature T4 protein, a polypeptide of the formula 

AA^-AA^^ of mature T4 protein, a polypeptide of the 
formula Met-AA^_^^ of mature T4 protein, a polypep- 
tide of the formula AA_ 2 3" AA 131 of mature T4 protein, 
a polypeptide of the formula AA 1 ~AA 131 of mature T4 

35 protein, a polypeptide of the formula Met-AA 1-131 of 
mature T4 protein, a polypeptide of the formula 
AA 23 -AA 145 of mature T4 protein, a polypeptide of 
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the formula AA X -AA 145 Q f mature T4 protein, a polypep- 
tide of the formula Met-AA 1-145 of mature T4 protein, 
a polypeptide of the formula AA_ 23 -AA 166 of mature 
T4 protein, a polypeptide of the formula AA^AA^g 
5 of mature T4 protein, a polypeptide of the formula 

Met-AA 1 _ 166 of mature T4 protein, or portions thereof. 

The amino terminal amino acid of mature 
T4 protein isolated from T cells begins at lysine, 
the third amino acid of the sequence depicted in 

10 Figure 16. Accordingly, soluble T4 proteins also 
include polypeptides of the formula AA 3 -AA 37? of 
Figure 16, or portions thereof. Such polypeptides 
include polypeptides selected from the group consist- 
ing of a polypeptide of the formula AA 3 to AA 362 of 

15 Figure 16, a polypeptide of the formula AA 3 to AA 3?4 
of Figure 16, a polypeptide of the formula AA 3 -AA 182 
of Figure 16, a polypeptide of the formula AA 3 -AA 113 
of Figure 16, a polypeptide of the formula AA 3 -AA 131 
of Figure 16, a polypeptide of the formula AA 3 -AA 145 

20 of Figure 16, a polypeptide of the formula AA 3 ~AA 166 
of Figure 16, and a polypeptide of the formula 
AA 3 -AA 11]L of Figure 16. Soluble T4 proteins also 
include the above-recited polypeptides preceded by 
an N- terminal methionine group. 

25 Soluble T4 protein constructs according to 

this invention may also be produced by truncating 
the full length T4 protein sequence at various posi- 
tions to remove the coding regions for the transmem- 
brane and intracytoplasmic domains, while retaining 

30 the extracellular region believed to be responsible 
for HIV binding. More particularly, soluble T4 
polypeptides may be produced by conventional tech- 
niques of oligonucleotide directed mutagenesis; 
restriction digestion, followed by insertion of 

35 linkers; or chewing back full length T4 protein with 
enzymes . 
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Altematively, soluble T4 polypeptides may 
be chemically synthesized by conventional peptide 
synthesis techniques, such as solid phase synthesis 
[R. B. Merri field, "Solid Phase Peptide Synthesis. 
5 I. The Synthesis Of A Tetrapeptide" , J. Am. Chem. 
Soc. , 83, pp. 2149-54 (1963)]. 

The DNA sequences of this invention code 
for soluble proteins and derivatives that are believed 
to bind to Major Histocompatibility Complex antigens 
10 and envelope glycoprotein of certain retroviruses, 

such as HIV. Preferably, they also inhibit syncytium 
formation, believed to be the mode of intracellular 
HIV virus spread. And, they may inhibit interaction 
between T4 + lymphocytes and antigen-presenting cells 
15 and targets of T4 + cell mediated killing. Most 

preferably, they also inhibit adhesion between T4 + 
lymphocytes and infective agents, such as the HIV 
virus, whose primary targets are T4 + lymphocytes. 

The DNA sequences of this invention are 
20 also useful for producing soluble T4 or its deriva- 
tives coded for on expression by them in unicellular 
hosts transformed with those DNA sequences. As well 
known in the art, for expression of the DNA sequences 
of this invention, the DNA sequence should be opera- 
25 tively linked to an expression control sequence in 
an appropriate expression vector and employed in 
that expression vector to transform an appropriate 
unicellular host. 

Such operative linking of a DNA sequence 
30 of this invention to an expression control sequence, 
of course, includes the provision of a translation 
start signal in the correct reading frame upstream 
of the DNA sequence. If the particular DNA sequence 
of this invention being expressed does not begin 
35 with a methionine, the start signal will result in 
an additional amino acid — methionine — being 
located at the N- terminus of the product. While 
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such me thionyl- containing product may be employed 
directly in the compositions and methods of this 
invention, it is usually more desirable to remove 
the methionine before use. Methods are available in 
5 the art to remove such N-terminal methionines from 
polypeptides expressed with them. For example, 
certain hosts and fermentation conditions permit 
removal of substantially all of the N-terminal 
methionine in vivo . Other hosts require in vitro 
10 removal of the N-terminal methionine. However, such 
in vivo and in vitro methods are well known in the 
art. 

A wide variety of host/expression vector 
combinations may be employed in expressing the DNA 

15 sequences of this invention. Useful expression 
vectors, for example, may consist of segments of 
chromosomal, non-chromosomal and synthetic DNA 
sequences, such as various known derivatives of SV40 
and known bacterial plasmids, e.g., plasraids from 

20 E.coli including col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, e.g., RP4, 
phage DNAs, e.g., the numerous derivatives of phage k, 
e.g., NM989, and other DNA phages, e.g., M13 and 
filamenteous single stranded DNA phages, yeast plas- 

25 mids, such as the 2m plasmid or derivatives thereof, 
and vectors derived from combinations of plasmids 
and phage DNAs , such as plasmids which have been 
modified to employ phage DNA or other expression 
control sequences. For animal cell expression, we 

30 prefer to use plasmid pBG368, a derivative of pBG312 
[R. Cate et al., "Isolation Of The Bovine And Human 
Genes For Mullerian Inhibiting Substance And 
Expression Of The Human Gene In Animal Cells", Cell, 
45, pp. 685-98 (1986)] which contains the major late 

3 5 promoter of adenovirus 2. 

In addition, any of a wide variety of 
expression control sequences — sequences that con- 
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txol the expression of a DNA sequence when opera- 
tively linked to it — may he used in these vectors 
to express the DNA sequence of this invention. Such 
useful expression control sequences, include, for 
5 example, the early and late promoters of SV40 or the 
adenovirus, the lac system, the trp system, the TAC 
or TRC system, the major operator and promoter regions 
of phage \, the control regions of fd coat protein, 
the promoter for 3-phosphoglycerate kinase or other 
10 glycolytic enzymes, the promoters of acid phosphatase, 
e.g., Pho5, the promoters of the yeast o-mating 
factors, the polyhedron promoter of the baculovirus 
system and other sequences known to control the 
expression of genes of prokaxyotic or eukaryotic 
15 cells or their viruses, and various combinations 

thereof. For animal cell expression, we prefer to 
use an expression control sequence derived from the 
major late promoter of adenovirus 2. 

A wide variety of unicellular host cells 
20 are also useful in expressing the DNA sequences of 
this invention. These hosts may include well known 
eukaryotic and prokaryotic hosts, such as strains of 
E.coli , Pseudomonas , Bacillus , Streptomyces , fungi, 
such as yeasts, and animal cells, such as CHO and 
25 mouse celis, African green monkey cells, such as 

COS 1, COS 7, BSC 1, BSC 40, and BMT 10, insect cells, 
and human cells and plant cells in tissue culture. 
For animal cell expression, we prefer CHO cells and 
COS 7 cells. 

30 It should of course be understood that not 

all vectors and expression control sequences will 
function equally well to express the DNA sequences 
of this invention. Neither will all hosts function 
equally well with the same expression system. How- 

35 ever, one of skill in the art may make a selection 
among these vectors, expression control sequences, 
and hosts without undue experimentation and without 
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departing from the scope of this invention. For 
example, in selecting a vector, the host must be 
considered because the vector must replicate in it. 
The vector's copy number, the ability to control 
5 that copy number, and the expression of any other 
proteins encoded by the' vector, such as antibiotic 
markers, should also be considered. 

in selecting an expression control sequence, 
a variety of factors should also be considered. 

10 These include, for example, the relative strength of 

the system, its controllability, and its compatibility 
with the particular DNA sequence of this invention, 
particularly as regards potential secondary struc- 
tures. Unicellular hosts should be selected by 

15 consideration of their compatibility with the chosen 
vector, the toxicity of the product coded for on 
expression by the DNA sequences of this invention to 
them, their secretion characteristics, their ability 
to fold proteins correctly, their fermentation re- 

20 quirements, and the ease of purification of the 

products coded on expression by the DNA sequences of 
this invention. 

Within these parameters, one of skill in 
the art may select various vector/expression control 

25 system/host combinations that will express the DNA 

sequences of this invention on fermentation or in large 
scale animal culture, e.g., CHO cells or COS 7 cells. 

The polypeptides produced on expression of 
the DNA sequences of this invention may be isolated 

30 from the fermentation or animal cell cultures and 
purified using any of a variety of conventional 
methods. One of skill in the art may select the 
most appropriate isolation and purification tech- 
niques without departing from the scope of this 

35 invention. 

The polypeptides produced on expression of 
the DNA sequences of this invention are essentially 
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free of other proteins of human origin. Thus, they 
are different than T4 protein purified from human 
lymphocytes . 

The polypeptides of this invention are , 

5 useful in immuno therapeutic compositions and methods - 
For example, the polypeptides of this invention are 
active in inhibiting infection by agents whose primary 
targets are T4 + lymphocytes by interfering with their 
interaction with those target lymphocytes. More 
10 preferably, the polypeptides of this invention may 

be employed to saturate the T4 receptor sites of T4- 
targeted infective agents . Thus , they exert anti- 
viral activity by competitive binding with cell 
surface T4 receptor sites. This effect is plainly 

15 of great utility in diseases, such as AIDS, ARC and 
HIV infection. Accordingly, the polypeptides and 
methods of this invention may be used to treat humans 
having AIDS , ARC, HIV infection or antibodies to 
HIV. In addition, these polypeptides and methods 

20 may be used for treating AIDS-like diseases caused 
by retroviruses, such as simian immunodeficiency 
viruses, in mammals, including h uman s. 

According to one embodiment of this inven- 
tion, antibodies to soluble T4 proteins and polypep- 

25 tides may be used in the treatment, prevention, or 
diagnosis of AIDS, ARC and HIV infection. 

The polypeptides of this invention may 
also be used in combination with other therapeutics 
used in the treatment of AIDS, ARC and HIV infection. 

30 For example, soluble T4 polypeptides may be used in 
combination with anti-retroviral agents that block 
reverse transcriptase, such as A2T, HPA-23 , phos- 
phonoformate, suramin, ribavirin and dideoxyciti- 
dine. Additionally, these polypeptides may be used 

35 with anti -viral agents such as interferons, includ- 
ing alpha interferon, beta interferon and gamma 
interferon, or glucosidase inhibitors, such as 
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castanospermine. Such combination therapies advan- 
tageously utilize lower dosages of those agents, 
thus avoiding possible toxicity. 

And, the polypeptides of this invention 
5 may be used in plasmapheresis techniques or in blood 
bags for selective removal of viral contaminants 
from blood. According to this embodiment of the 
invention, soluble T4 polypeptides may be coupled to 
a solid support, comprising, for example, plastic or 

10 glass beads, or a filter, which is incorporated into 
a plasmapheresis unit. 

Additionally, the compositions of this 
invention may be employed as immunosuppressants use- 
ful in preventing or treating graft-vs-host disease, 

15 autoimmune diseases and allograft rejection. 

The compositions of this invention typi- 
cally comprise an immunotherapeutic effective amount 
of a polypeptide of this invention and a pharmaceu- 
tical^ acceptable carrier. Therapeutic methods of 

20 this invention comprise the step of treating patients 
in a pharmaceutically acceptable manner with those 
compositions . 

The compositions of this invention for use 
in these therapies may be in a variety of forms. 

25 These include, for example, solid, semi-solid and 

liquid dosage forms, such as tablets, pills, powders, 
liquid solutions or suspensions, liposomes, supposi- 
tories, injectable and infusable solutions. The 
preferred form depends on the intended mode of admin- 

30 istration and therapeutic application. The composi- 
tions also preferably include conventional pharma- 
ceutically acceptable carriers and adjuvants which 
are known to those of skill in the art. 

Generally, the pharmaceutical compositions 

35 of the present invention may be formulated and admin- 
istered using methods and compositions similar to 
those used for other pharmaceutically important poly- 
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peptides (e.g., alpha- inter fer on ) . Thus, the poly- 
peptides may be stored in lyophilized form, reconsti- 
tuted with sterile water just prior to administration, 
and administered by the usual routes of administration 
5 such as parenteral, subcutaneous, intravenous, intra- - 
muscular or intralesional routes. An effective dosage 
may be in the range of from 0.5 to 5.0 mg/kg body , 
weight/day, it being recognized that lower and higher 
doses may also be useful. 
10 This invention also relates to soluble 

receptors and their use in diagnosing or treating 
viral agents which target or bind to those receptors. 
Such soluble receptors may be used as decoys to 
absorb viral agents and to halt the spread of viral 
15 infection. Alternatively, virus-killing agents may 
be attached to the soluble protein receptors, 
providing a direct mode of delivery of those agents 
to the virus. 

More particularly, the polypeptides of 
20 this invention are useful in diagnostic compositions 
and methods to detect or monitor the course of HIV 
infection. Advantageously, these polypeptides are 
useful in diagnosing variants of the HIV virus, 
regardless of origin of the infecting HIV agent. 
25 For example, soluble T4 proteins and poly- 

peptides according to this invention, which have a 
high affinity for HIV, may be advantageously used to 
increase the sensitivity of HIV assay systems now 
based upon monoclonal or polyclonal antibodies . 
30 More specifically, soluble T4 proteins and polypep- 
tides may be used to pretreat test plasma to concen- 
trate any HIV present, even in small amounts, so 
that it is more easily recognized by the antibody. 
And soluble T4 proteins and polypeptides may be used 
35 to purify the HIV envelope protein gpl20. 

Alternatively, the soluble T4 proteins and 
polypeptides of this invention may be used to replace 
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anti-HIV antibodies now used in various assays. 
These soluble T4 proteins and polypeptides are be 
preferable to anti-HIV antibodies for two reasons. 
First, soluble T4, exhibits an affinity for HIV of 
5 approximately 10" 9 , a level which exceeds the 10" 
to 10" 8 values of anti-HIV antibodies. And, while 
anti-HIV antibodies are more likely to be specific 
for different HIV isolates, strain variations would 
not affect a soluble T4 protein-based assay, since 

10 all HIV isolates must be capable of interacting with 
the T4 receptor as a prerequisite to infectivity. 

For example, a soluble T4 protein or poly- 
peptide may be linked to an indicator, such as an 
enzyme, and used in an ELISA assay. Here, soluble 

15 T4 advantageously acts as a measure of both HIV in a 
test sample and any free HIV envelope gpl20 protein. 

And, polyvalent forms of soluble T4 proteins 
or polypeptides may be produced, for example, by 
chemical coupling or genetic fusion techniques, thus 

20 increasing even further the avidity of soluble T4 
for HIV. 

In order that this invention may be better 
understood, the following examples are set forth. 
These examples are for purposes of illustration only, 
25 and are not to be construed as limiting the scope of 
the invention in any manner. 

EXAMPLES 

Purification Of Native Solubilized T4 

We purified native T4 from the T4 + -promono- 
30 cytic cell line U937 derived from a histocytic 

lymphoma to approximately 50% purity usir.g immuno- 
affinity chromatography as follows. 

We grew U937 cells [a gift from Dr. Scott 
Hammer, New England Deaconess Hospital] to 
35 10 6 cells/ml in RFMI 1640, 10% FCS, harvested and 

washed them in IX PBS. We then lysed the cell pellet 
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in 20 mM Tris-HCl (pH 7.7), 0.5% NP-40 (a non-ionic 
detergent), 0.2% NaDOC, 0.2 mM EGTA, 0.2 mM FMSF and 
5 ug/ml BPTI at 4 x 10 7 cells/ml. Because this 
purification was carried out in the presence of a 

5 non-ionic detergent, T4, which is normally membrane- 
bound via its hydrophobic transmembrane domain, was 
isolated as a solubilized protein. We spun the lysate 
in a GS 3 rotor for 10 min at 10,000 rpm and stored 
the supernatant at -70 °C. 

10 Subsequently, we preabsorbed the clarified 

cell extract with mouse IgG-Sepharose, followed by 
protein A Sepharose and then passed the flowthrough 
through an immunoaf finity column comprising immobil- 
ized 19Thy anti-T4 monoclonal antibody on Affigel-10 

15 [a gift from Dr. Ellis Reinherz, Dana Farber Cancer 
Institute, Boston, Massachusetts] . We washed the 
column extensively and eluted the bound material 
with 50 mM glycine-HCl (pH 2.5), 0.15 M NaCl, 0.5% 
NP-40, 5 \iq/ml BPTI and 0.2 mM EGTA. 

20 We then separated 10 jjI aliguots of each 

elution fraction on a 10% SDS-PAGE under reducing 
conditions, with the bands being visualized by silver 
staining. As shown in Figure 1, a major silver- 
stained band of 55 Kd was visible. We then carried 

25 out two assays on the 55 Kd protein and sequenced 
the amino terminus of the protein to confirm its 
identity as native solubilized T4. 

Sequencing Of Native Solubilized T4 

We determined the N- terminal amino acid 
30 sequence of our solubilized native T4 which we 

isolated from a detergent extract of U937 cells by 
immunoaffinity chromatography as described above. 

Techniques for determining the amino acid 
sequences of various proteins and peptides derived 
35 from them are well known in the art. We chose auto- 
mated Edman degradation to determine the amino 
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texminus of our solubilized native T4. More speci- 
fically, we gel purified and electroeluted approxi- 
mately 5 pg of the solubilized native T4 and then 
subjected it to automated Edman degradation using a 
5 gas phase sequencer (Applied Biosystems 470A) . We 
then identified the PTH-amino acids produced at each 
cycle of the Edman chemistry by high pressure liquid 
chromatography, on-line with the sequencer, in a 
PTH-amino acid analyzer (Applied Biosystems 120A). 
10 Direct analysis of the protein provided amino terminal 
sequence information which, when compared to the 
amino acid sequence deduced from the cDNA sequence 
of human T4 F Maddon et al. (1985), supra], identified 
the purified protein as human T4. 

15 Radioimmunoassay Of Native S olubilized T4 

To determine that our purification process 
enriched for T4, we assayed fractions from the 
immunoaffinity elution step in a T4-specific sandwich 
radioimmunoassay, based upon the ELISA assay of P. E. 

20 Rao et al . , in Cellular Immunology , 80, pp. 310-19 
(1983). We coated each well of a Removawell strip 
(Dynatech Labs, Alexandria, Virginia) with 50 pi of 
10 pi/ml 0KT4 antibody (ATCC #CRL 8002) or MOPC195 
(a background binding control) in 0.05 M sodium 

25 bicarbonate buffer (pH 9.4) at 4°C overnight. We 

washed the wells and then filled them with 1% FCS in 
PBS to saturate the protein binding capacity of the 
plastic. After removing the 1% FCS solution, we 
added test samples, in 50 ul aliquots, to the wells. 

30 We then incubated the samples for 4 hours at room 
temperature. Subsequently, we removed the samples 
and washed the wells four times with 0.05% Tween-20 
in PBS . We then added 125 I-labelled 19Thy antibody 
(50,000-100,000 cpm per well) and incubated the wells 

35 at 4°C overnight. We then washed the wells four 
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times and separated each well for bound I detection 
in a Beckman gamma detector. 

As shown in Figure 1, in which values were 
plotted following subtraction for background, the 
5 peak fraction of solubilized native T4 protein * 
detected by radioimmunoassay coincided with elution 
of the 55 Kd protein seen by silver staining. , 

Western Blot Assay For T4 

Although many antibodies have been developed 
10 for detecting T4 antigen, none are useful for protein 
blot analysis (Dr. Ellis Reinherz, personal communi- 
cation). In order to develop antibodies useful for 
Western blot detection of soluble T4 to follow the 
purification of T4 and recombinant soluble T4, we 
15 raised polyclonal, hyperimmune anti-T4 antisera in 
rabbits against three synthetic T4 oligopeptides. 
These oligopeptides are represented in Figure 3 as 
follows: 

Oligopeptide Amino Acid Coordinates 
20 JB-1 44-63 

JB-2 133-156 
JB-3 325-343 
We had previously synthesized these peptides using 
conventional phosphoamide DNA synthesis techniques. 
25 See, e.g., Tetrahedron Letters , 22, pp. 1859-62 

(1981). We synthesized the peptides on an Applied 
Biosys terns 3 8 OA DNA Synthesizer and purified them by 
gel electrophoresis. 

(i) Coupling Of T4 Peptides To BTG ? 

30 We coupled each of these peptides to the 

carrier protein bovine thyrogobulin ("BTG") [Sigma, 
St. Louis, Missouri] according to a modification of 
procedures set forth in J. Rothbard et al., J. Exp. 
Med. , 160, pp. 208-21 (1984) and R. C. Kennedy et al . , 

35 "Antiserum To A Synthetic Peptide Recognizes The 
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HTLV-III Envelope Glycoprotein", Science , 231, 
pp. 1556-59 (1986). 

More specifically, we mixed 10 mg of BTG 
diluted in 1 ml of PBS with 1.3 mg of m-maleimido- 
5 benzoyl -N-hydroxysuccinimide ester ("MBS") in 0.5 ml 
of dimethylformamide ( "DMF" ) . We mixed the reaction 
mixture well and reacted it for about 1 hour at 25°c. 
Subsequently, we loaded the mixture onto a Sephadex 
G25 gel filtration column (Pharmacia, Sweden) which 

10 had been pre-equilibrated with 0.1 M PBS (pH 6.0). 
We then collected a total of thirty 2 ml aliquot 
elution fractions and read the absorbance of each 
fraction at 280 nm ( U * 2Q0 U ). We then pooled the 
three peak fractions (15, 16 and 17) to create the 

15 activated carrier. 

We dissolved 10 mg of NaBH 4 in 2.5 ml of 
0.1 M sodium borate solution to produce a sodium 
borohydride solution. Subsequently, we diluted 
approximately 8 mg of each of synthetic T4 peptides 

20 JB-1, JB-2 and JB-3 with 1 ml of 0.1 M borate buffer 

and then mixed each solution with 200 pi of the sodium 
borohydride solution, incubating the mixture on ice 
for 5 minutes. We then warmed each peptide solution 
to 25°C, brought each solution to pH 1.0 with 1 N 

25 HCl (during which frothing occurred) and then brought 
each solution to pH 7.0 with 1 N NaOH (after the 
frothing had stopped) . 

We then coupled each peptide to BTG by 
adding 1.2 ml of the peptide solution to 6 ml of the 

30 activated carrier solution. We allowed the coupling 
reaction to proceed overnight by incubating the reac- 
tion mixture at room temperature. 

(ii) inoculation Of Test Animals 

We dissolved each of the BTG-coupled pep- 
35 tides prepared above in sterile Freund's complete 

adjuvant, to a final concentration of 1 pg/ml coupled 
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peptide in PBS. Subsequently, we inoculated each of 
three rabbits (New Zealand white) by intramuscular 
injection of 500 pg of one of the coupled peptides 
into each rabbit. We inoculated a fourth rabbit 
5 (New Zealand white) in the same manner with a mixture 
of the three coupled peptides. All rabbits were 
prebled prior to boosting to establish an average 
baseline for each response to be measured. The 
rabbits were boosted at 6 weeks with 500 pg coupled 
10 peptide in incomplete Freund's adjuvant. 

Serum was collected from each rabbit monthly 
for 4 months after immunization. The serum was then 
assayed for antipeptide titer. 

(iii) ELISA With Antipeptide Sera 
15 Against Peptide Coated Plates 

In this assay, we determined that antiserum 
raised in an animal by each of peptides JB-1, JB-2 
and JB-3 binds to that peptide. Accordingly, those 
peptides are immunogenic and elicit a response in 

20 test animals. 

To carry out the assay, we coated Immulon-2 
(Dynatech Labs, Alexandria, Virginia) microti ter 
plates with 50 jjI per well of 50 pg/ml uncoupled 
peptide in PBS and incubated the plates overnight at 

25 4°C. Plates coated with peptide 46R*, which served 

as controls, were treated identically. We then washed 
the plates 4 times with PBS-Tween (0.5%) and 4 times 
with water. The plates were blotted dry by gentle 
tapping over paper towels. After blotting the plates, 



* Peptide 46 corresponds to amino acids ( "AA" ) 
728-751 of the env gene of the HIV genome. The amino 
acid numbering corresponds to that set forth for the 
env gene in L. Ratner et al., "Complete Nucleotide 
35 Sequence Of The AIDS Virus, HTLV-III", Nature , 313, 
pp. 277-84 (1985). Peptide 46 has the sequence: 
LP IPRGPDRPEGIEEEGGERDRDR . 



WO 89/01540 



PCT/US88/02940 



-33- 

we added 200 ul of a 5% FCS/PBS solution to each 
well and incubated the plates for 1 hour at room 
temperature . 

We then assayed serum samples from the 
5 rabbits on the pre-coated plates prepared as described 
above. We assayed the antibody response to the 
immunogen peptide at an initial dilution of 1:100, 
followed by serial 10-fold dilutions in 5% FCS/PBS. 

After a 2 hour incubation period at room 

10 temperature, we washed the plates and blotted them 
dry as described above. We then added 50 ul of a 
1:1500 dilution of horseradish peroxidase ( "HRP" ) - 
conjugated goat anti-rabbit-IgG [Cooper Biomedical, 
Malvern, Pennsylvania] in 5% FCS/PBS to each well 

15 and incubated the plates at room temperature for 

1 hour. We washed the plates with PBS-Tween 0.5%. 
We then added 50 pi of 0.42 mM TMB. We stopped the 
enzyme reactions with 50 pi of 2 M H 2 S0 4 - We then 
analyzed the plates spectrophotometrically at 450 nm 

20 using a microtiter "plate reader [Dynatech Labs, 
Alexandria, Virginia]. 

We observed that antiserum against each of 
peptides JB-1, JB-2 and JB-3 binds to the corre- 
sponding peptide. We also observed that antiserum 

25 against a mixture of peptides JB-1, JB-2 and JB-3 

binds to peptides JB-1 and JB-3 under the conditions 
set forth above. The titers of each of the four 
antisera tested against the peptides in the solid- 
phase ELISA are shown below, where "ND" represents 

30 values not determined: 

Approximate Titer Against: 

Peptide JB-1 JB-2 JBrl 

JB-1 >l/50,000 0 ND 

jB-2 0 1/50,000 ND 

35 JB-3 0 0 1/10,000 

JB-1 + JB-2 + JB-3 1/4,000 ND 1/7,000 
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Ig fractions from two of the three anti- 
peptide sera raised against individual peptides, 
anti-JB-1 and anti-JB-2, recognized the 55 Kd T4 
antigen band of native solubilized T4 in a Western 
5 blot analysis of protein eluted from the 19Thy * 
(anti-T4) monoclonal antibody affinity column 

described above. As in the case of the radioimmuno- * 
assay of native solubilized T4, the detection of the 
55 Kd protein coincides with its apparent elution 

10 from the affinity column- This provides further 

evidence that our T4 purification procedure enriched 
for solubilized T4. 

Thus, these polyclonal sera are useful in 
the detection of nanogram quantities of T4 (both 

15 native and recombinant forms) by Western analysis. 

Binding of Cell-Free T4 To HIV Envelope 

We then tested our purified solubilized 
native T4 isolated from U937 cells for its ability 
to bind to the HIV envelope protein gpl60/gpl20. To 

20 carry out this direct binding assay, we incubated 
35 S-labelled gpl60/gp!20 detergent cell extract 
derived from a recombinant cell line 7d2 (a gift 
from Drs. Mark Kowalski and William Haseltine, Dana- 
Farber cancer Institute) with samples of solubilized 

25 native T4, each of which had been preincubated with 
one typ e °f monoclonal antibody. 

More specifically, we mixed 5 pi of solu- 
bilized T4 in a microfuge tube with 5 yg (about 3 pi) 
of 0KT4 (ATCC #CRL 8002 ) , a monoclonal antibody 

30 recognizing an epitope on T4 which does not interfere 
with HIV binding [J. A. Hoxie et al. r J . Immunol . , 
136, pp. 361-63 (1986)] or with 5 pg of 0KT4A (Ortho 
Diagnostics #7142), a monoclonal antibody that inter- 
feres with HIV binding to T4 positive cells [J. Steven 

35 McDougal et al . , J. Immunol. , 137 , pp. 2937-2944 

(1986)]. Alternatively, we mixed 50 pi of solubilized 
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T4 with 5 |jg of aHTLV III gpl20 (Dupont #NEN-9284). 
We then incubated the mixtures on ice for 1 hour. 

Subsequently, we added 150 pi of S- 
labelled gpl60/gpl20 cell extract or 35 S-labelled 
5 control cell extract (precleared with protein-A 

Sepharose) to the preincubated solubilized T4/mono- 
clonal antibody mixtures and rocked the tubes over- 
night at 4°C. We then precipitated the T4/gpl60/gpl20 
immune complexes by adding 30 pi of protein-A 

10 Sepharose to each tube and rocking for 2 hours at 

4°C to allow the protein-A Sepharose to bind to the 
antibody complexes. Subsequently, we spun down the 
beads in an Eppendorf microfuge and after extensive 
washings, we eluted with 40 pi SDS sample buffer at 

15 65 °C for 10 minutes. We then loaded 20 pi of the 

eluted material on a 7.5% SDS-PAGE gel which was run 
under reducing conditions . 

Figure 2 depicts auto radio graph and Western 
blot results of the T4/gpl60/gpl20 coiramunoprecipita- 

20 tions. In Figure 2, lanes 1-5 were autoradiographed 
after treatment with 40% sodium salicylate and lanes 
6-7 were developed on a Western blot with rabbit 
antisera JB-2 . 

As shown in Figure 2, gpl60/gpl20 protein 

25 was coimmunoprecipitated in the presence of T4 with 
OKT4 (lane 5) but not in the presence of T4 with 
0KT4A (lane 4). Lane 3 shows the positive control 
for gpl60/gpl20 using aHTLV III gpl20 monoclonal 
antibody. Neither negative control with 35 S-labelled 

30 control extract (lane 1) or protein-A Sepharose alone 
(lane 2) showed bands migrating in the position of 
gpl60/gpl20. Based upon the bands that developed 
on the Western blot, the amount of T4 precipitated 
with either 0KT4 (lane 6) or 0KT4A (lane 7) appeared 

35 to be similar. 

This demonstrates that purified, solubilized 
native T4, which is naturally membrane bound, can 
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still interact with the HIV glycoprotein in solution. 
Accordingly/ we believe that cell free soluble T4 is 
useful in preventing the binding interaction between 
HIV and the T4 receptor of T4 + lymphocytes. By com- 
5 peting with cell surface T4 for binding to the HIV 

envelope protein gp!20, soluble T4 is useful in block- 
ing HIV infection. 

Synthesis Of Oligonucleotide DNA Probes 

The nucleotide sequence and a deduced amino 
10 acid sequence for a cDNA that purportedly encodes 
the entire human T4 protein have been reported 
f Maddon et al . , (1985), supra]. The deduced primary 
structure of the T4 protein reveals that it can be 
divided into domains as demonstrated below: 

15 Amino Acid 

Structure/Proposed Location Coordinates 

Hydrophobic/Secretory signal -23 to -1 

Homology to V-Regions/ 

Extracellular +1 to +94 

20 Homology to J-Regions/ 

Extracellular +95 to +109 

Glycosylated Region/ 

Extracellular +110 to +374 

Hydrophobic/Transmembrane 
25 Sequence +375 to +395 

Very Hydrophilic/ 

Intracytoplasmic +396 to +435 

Based on the sequence for the above-listed 
domains, we chemically synthesized antisense 

30 oligonucleotide DNA probes using conventional phos- 
phoamide DNA synthesis techniques. See, e.g., 
Tetrahedron Letters , 22, pp. 1859-62 (1981). We 
synthesized the probes on an Applied Biosystems 380A 
DNA synthesizer and purified them by gel electro- 

35 phoresis. 
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Furthermore, we synthesized the probes 
such that they were complementary to the DNA 
sequences which code for the amino acid sequence, 
i.e., the probes were antisense, to enable them to 
5 recognize and hybridize to the corresponding sequences 
in DNA, as well as in mRNA. The nucleotide sequences 
of the eleven selected regions of the T4 protein 
[corresponding to the nucleotide numbering set forth 
in Maddon et al . , (1985), supra] were the following: 
L0 Nucleotide 
Oligonucleotide Coordinates 

1 145-171 

2 742-765 

3 1414-1440 
15 6 427-453 

7 1303-1329 

8 1012-1038 

9 97-118 
10 10-36 

20 ii 1698-1724 

12 397-423 

14 261-287 

Before using our DNA probes for screening, 
we 5» end-labelled each of the single -stranded DNA 
25 probes with 32 F using [ V - 32 P]-ATP and T4 polynucleo- 
tide kinase, substantially as described by A. M. Maxam 
and W. Gilbert, "A New Method For Sequencing DNA", 
Proc. Natl. Acad. Sci. USA , 74, pp. 560-64 (1977). 

Construction of \gtl0 Peripheral Blood 
30 Lymphocytes cDNA Library 

To prepare our Peripheral Blood Lymphocytes 
(PBL) cDNA library, we processed PBL, from a single 
leukophoresis donor, through one round of absorption 
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to remove monocytes. We then stimulated the non- 
adherent cells with IFN-y 1000 U/ml and 10 pg/ml PHA 
for 24 hours. We isolated RNA from these cells using 
phenol extraction [Maniatis et al.. Molecular Cloning , 
5 p. 187 (Cold Spring Harbor Laboratory) (1982)] and 
prepared poly A + mRNA by one round of oligo dT cellu- 
lose chromatography. We ethanol precipitated the 
RNA, dried it in a speed vac and resuspended the RNA 
in 10 pi H 2 0 (0.5 pg/pl). We treated the RNA for 10 
10 min at room temperature in CH^HgOH (5 mM final con- 
centration) and 0-mercaptoethanol (0.26 M). We then 
added the methyl mercury treated RNA to 0.1 M Tris-HCl 
(pH 8.3) at 43 °C, 0.01 M Mg, 0.01 M DTT, 2 mM Vanadyl 

complex, 5 pg oligo <^ 12 -18' 20 ^ KCl/ 1 ^ dCTP ' 

15 dGTP, dTTP, 0.5 mM dATP, 2 pCi [cr- 32 P ] dATP and 30 U 

1.5 m! AMV reverse transcriptase (SeikagaJcu America) 
in a total volume of 50 pi. We incubated the mixture 
for 3 minutes at room temperature and then for 3 hours 
at 44 °C, after which time we stopped the reaction by 

20 the addition of 2.5 pi of 0.5 M EDTA. 

We extracted the reaction mixture with an 
equal volume of phenol: chloroform (1:1) and precipi- 
tated the aqueous layer two times with 0.2 volume of 
10 M NH^AC and 2.5 volumes EtOH and dried it under 

25 vacuum. The yield of cDNA was 1.5 pg. 

We synthesized the second strand according 
to the methods of Okayama and Berg [ Mol. Cell. Biol. , 
2, p. 161 (1982)] and Gubler and Hoffman [ Gene , 25, 
pp. 263-69 (1983)], except that we used the DNA poly- 

30 merase I large fragment in the synthesis. 

We blunt ended the double- stranded cDNA by 
resuspending the DNA in 80 m1 TA buffer (0.033 M Tris 
Acetate (pH 7.8); 0.066 M KAcetate; 0.01 M MgAcetate; 
0.001M DTT; 50 pg/ml BSA) , 5 pg RNase A, 4 units RNase 

35 H, 50 pM 3 NAD , 8 units E.coli ligase, 0.3125 mM 
dATP, dCTP, dGTP, and dTTP, 12 units polymerase 
and incubated the reaction mixture for 90 min at 
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37 Q C, added 1/20 volume of 0 . 5M EDTA, and extracted 
with phenol: chloroform. We chromatographed the 
aqueous layer on a G150 Sephadex column in 0.01M 
Tris-HCl {pH 7.5), 0.1 M NaCl, 0.001 M EDTA and 
5 collected the lead peak containing the double-stranded 
cDNA and ethanol precipitated it. Yield: 0.605 
cDNA. 

We li gated the double- stranded cDNA to 

linker 3S/36: 

10 5 1 AATTCGAGCTCGAGCGCGGCCGC3 ' 

3 ' GCTCGAGCTCGCGCCGGCG5 1 
using standard procedures. We then size selected 
the cDNA for 800 bp and longer fragments on a S500 
Sephacryl column, and ligated it to EcoRI -digested 

15 bacteriophage lambda vector gtlO (a gift of 

Dr. Ellis Reinherz). We packaged aliguots of the 
ligation reaction in Gigapak (Strategene) according 
to the manufacturer's protocol. We used the packaged 
phage to infect E.coli BNN102 cells and plated the 

20 cells for amplification. ■ The resulting library con- 
tained 1.125 x 10 6 independent recombinants. 

We also screened a PBL cDKA library in the 
bacteriophage lambda vector gtlO (a gift of Dr. Ellis 
Reinherz)/ which was synthesized from mRNA from a 

25 T4 + tumor cell line named REX, which expresses T4 

protein at high levels [O. Acuto et al., "The Human 
T Cell Receptor: Appearance In Ontogeny And 
Biochemical Relationship Of Lambda and Beta Subunits 
on IL-2 Dependent Clones And T Cell Tumors", Cell, 

30 34, pp. 717-26 (1983)]. 

Screening Of The Libraries 

32 

We then used three of our F-labelled 
synthetic oligonucleotide antisense probes, probes 3, 
6 and 9, to screen in parallel our two A gtlO cDNA 
35 libraries using the plaque hybridization screening 

technique described in R. Cate et al., "Isolation Of 
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The Bovine And Human Genes For Mullerian Inhibiting 
Substance And Expression Of The Human Gene In Animal 
Cells", Cell , 45 , pp. 685-98 (1986), with minor 
modifications. We modified the Cate et al. proce- 
5 dure by hybridizing without tetramethyl ammonium 
chloride to accommodate our use of unique probes, 
rather than mixtures, to probe the plaque filters. 

We used the three probes, which had been 
previously 5' end-labelled with [y- 32 P]-ATP according 
10 to the method of A. Maxam and W. Gilbert, Meth. 

Enzymol . , 68, pp. 499-80 (1979) to screen in parallel 
the PBL cDNA library and the REX cDNA library dis- 
cussed above. 

From our screening of the PBL library, we 
15 isolated a nearly full length soluble T4 cDNA clone — 
\203-4 (or AgtlO.PBL.T4) — containing a 3.064 kb 
insert which could be cleaved from the AgtlO vector 
with EcoR I . 

From our screening of the REX cell library, 
20 we isolated an incomplete T4 cDNA clone containing 

a 1,200 bp cDNA insert. We then further characterized 
the DNA from these clones by DNA sequencing analysis. 

We also screened a bacteriophage lambda 
human genomic library, constructed in the vector 
25 EMBL3 by Dr. Mark Pasek (Biogen Inc., Cambridge, 

Massachusetts) [N. Murray in Lambda 2, eds. R. Hendrix, 
J. Roberts, F. Stahl, R. Weisberg, pp. 3935-422 (1983)]. 
The library contains DNA fragments, created by partial 
restriction of chromosomal DNA from the human lympho- 
30 blastid cell line GM1416,48 f XXXX (Human Genetic 
Mutant Cell Repository, Camden, New Jersey) with 
Sau3a, ligated onto EMBL3 arms which had been sub- 
jected to cleavage with BamHI according to the pro- 
cedures outlined in Maniatis et al. , (1982), supra. 
35 Plating of the phage library, lysis, and transfer of 
the phage DNA onto nitrocellulose were performed as 
described by W. D. Benton and R. W. David, "Screening 
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of Lambda gt Recombinant Clones By Hybridization To 
Single Plagues In Situ", Science , 196, p. 180 (1977) 
and M^iatis et al . (1982). Hybridization conditions 
were those described by Gate et al. (1986), supra, 
5 except that tetramethylammonium chloride (TMACl) was 
omitted from the washing buffer. 

Approximately 2 million plaques were 
screened in parallel hybridizations with probe 1 
and probe 3 discrussed above. One phage, called 

10 CM47, which hybridized with probe 3 in the primary 
screenings, was subjected to DNA sequence analysis 
to determine the existence and position of an intron 
between the coding sequences for the predicted extra- 
cellular and transmembrane domains. No phage clones 

15 containing T4 sequences were found screening with 
probe 1, probably because it includes a sequence 
interrupted by an intron [D. R. Littman and s. N. 
Gettner, Nature , 325, pp. 453-55 (1987); and our 
observations] . 

20 Partial sequence analysis of CM47 shows 

that an intron interrupts the sequence corresponding 
to the codon for valine (amino acid 363) of the 
deduced primary sequence for T4 (Figure 3 — in which 
introns are indicated by a solid line). This intron 

25 defines a potential site for introducing a stop codon 
in order to express a soluble form of T4. Another 
intron found within the coding sequence for T4 inter- 
rupts the codon for arginine (amino acid 295) and a 
third intron in CM47 is found between the codons for 

30 arginine (amino acid 402) and arginine (amino acid 
403) (Figure 3) . 

Sequencing Of cDNA Clones 

We then subcloned EcoRI digested DNA from 
clone A.203-4 into animal expression vector pBG312 
35 p R. Cate et al . , supra] to facilitate sequence 

analysis. More specifically, as depicted in Figure 4, 
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we then digested \gtlO .PBL.T4 with Eco RI to excise 
the 3.064 kbp EcoRI-EcoRI fragment containing the 
full length T4 cDNA. This cDNA sequence, including 
the entire coding region for soluble T4 and for full 
length T4 was deposited in p!70-2 . We used T4 ligase 
to ligate the fragment into animal expression vector 
pBG312 [supra] which had been previously cut with 
EcoR I , to form pBG312.T4 and pl70-2 (Figure 4). We 
then determined the nucleotide sequence of the Eco RI 
fragment of pBG312.T4 using Max am Gilbert technology 
[A. M. Maxam and W. Gilbert, "A New Method For 
Sequencing DNA" , Proc. Natl. Acad. Sci. USA , 74, 
pp. 560-64 (1977)] (see Figure 3, which depicts the 
PBL cDNA sequence in comparison to that reported by 
Maddon et al . , (1985), supra). This analysis showed 
that the 3.064 kbp PBL full length complementary DNA 
copy of T4 cDNA contained the coding sequence for 
T4, approximately 200 bp of 5' noncoding sequence 
and approximately 1500 bp of 3* noncoding sequence. 

We then cut pBG312.T4 with Pst I and removed 
the resulting 3 1 protruding ends with Klenow and 
isolated an approximately 2.5 kbp fragment. We then 
inserted the fragment into the poly linker of pBG312 
(which had been previously restricted at the Sma l 
site) to form plasmid pl70-2, which contains the 
full length PBL T4 cDNA sequence (see Figure 3). 

As depicted in Figure 3, the PBL T4 cDNA 
contains a nucleotide sequence almost identical to 
the approximately 1,700 bp sequence reported by 
Maddon et al . , (1985), supra. The PBL T4 cDNA, how- 
ever, contains three nucleotide substitutions that, 
in the translation product of this cDNA, would pro- 
duce a protein containing three amino acid substi- 
tutions compared to the sequence reported by Maddon 
et al. As shown in Figure 3, these differences are 
at amino acid position 3, where the asparagine of 
Maddon et al. is replaced with lysine; position 64, 
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where the tryptophan of Maddon et al. is replaced 
with arginine and at position 231, where the phenyl- 
alanine of Maddon et al. is replaced with serine. 
The asparagine reported at position 3 of Maddon 
5 et al. instead of lysine was the result of a sequenc- 
ing error (Dr. Richard Axel, personal communication). 
The significance of the a m ino acid replacements at 
positions 64 and 231, which may represent allellic 
polymorphism [T. C. Fuller et al., Human Immunology , 
10 9, pp. 89-102 (1982); W. Stohl and H. G. Kunkel, 

Scand. J. Immunol. , 20, pp. 273-78 (1984); N. Amino 
et al., Lancet , 2, pp. 94-95 (1984); and M. Sato 
et al., J. Immunol. , 132, pp. 1071-73 (1984)], is 
not known. 

15 DNA sequence analysis [ Maxam and Gilbert , 

supra] of the insert in pEClOO of the REX clone sug- 
gests that it represents the product of a splicing 
error, because 5' noncoding sequence appears to have 
been spliced with coding sequence beginning with the 

20 GGT codon for glycine (amino acid 49) (see Figure 3 
and Figure 5). The T4 coding sequence in pEClOO* 
from glycine (amino acid 49) to isoleucine (amino 
acid 435) is identical to the sequence of Maddon 
et al . , (1985), supra. 

25 In comparison, our earlier N- terminal pro- 

tein sequence analysis of native T4 protein purified 
from U937 cells shows a T4 expression product with 
aspaxgine as amino acid 3. These differences are 
also set forth in Figure 6, which also depicts com- 

30 parisons at corresponding positions of the partial 
clone from the REX cell line XgtlO library; our 



* we constructed pEClOO by digesting the incomplete 
T4 cDNA clone from the REX library with EcoRI and 
3 5 isolating the 1,200 bp cDNA insert. We then ligated 
it to pUC12 (Boehringer Mannheim, Indianapolis, 
Indiana) which had been previously cut with EcoRI to 
form pEClOO. 
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genomic clone from a \EMBL3 library; mouse T4 
sequences [Tourvieille et al. f Science , 234, p. 610 
(1986)] and sheep T4 sequences [Classon et al., 
Immunogenetics , 23, p. 129 -(1986)]. 

5 Construction of Soluble T4 Mutants 

We then employed the technique of in vitro * 
site-directed mutagenesis and restriction fragment 
substitution to modify the T4 cDNA coding sequence 
of pl70-2 in sequential steps to be identical to 

10 that reported by Maddon et al . , (1985), supra. We 
first used oligonucleoti de-directed mutagenesis to 
modify the amino acids at positions 3 and 64. Next, 
we employed restriction fragment substitution with a 
fragment including the serine 231 codon of a partial 

15 T4 cDNA isolated from a T4 positive lymphocyte cell 
line [O. Acuto et al., Cell , 34, pp. 717-26 (1983)] 
library in Agtll (a gift from Dr. Ellis Reinherz), 
to modify the amino acid at position 231. We then 
truncated our modified T4 cDNA sequence to remove 

20 the coding regions for the transmembrane and intra- 
cytoplasmic domains. Subsequently, we constructed 
three different soluble T4 mutants from our full 
length T4 clone PBL T4 by linker insertion between 
restriction sites in order to increase the probability 

25 of empirically finding a stable, secretable T4 mole- 
cule. The structure of each of these mutants is 
depicted in Figure 7A. 

Line A of Figure 7A represents a hydropathy 
analysis of our full length soluble T4 carried out 

« 

30 using a computer program called Pepplot (University 
of Wisconsin Genetics Computer Group) according to 
J. Kyte and R. F. Doolittle, J. Mol. Biol. , 157, 
pp. 105-32 (1982). Line B depicts the protein domain 
structure of full length T4 r Maddon et al. , (1985) 

35 supra] in which "S" represents the secretory signal 
sequence, "V" represents the immunoglobul in-like 
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variable region sequence, "J" represents the immuno- 
globulin-like joining region sequence, U U" represents 
the unique, extracellular region sequence, "TM" 
represents the transmembrane sequence and "C" repre- 
5 sents the cytoplasmic region sequence. In line B, 

the transmembrane amino acid sequence and some flank- 
ing sequence is written below the TM domain. Line C 
depicts the protein domain structure of recombinant 
soluble T4 mutants rsT4.1 in pBG377, rsT4.2 in pBG380 

10 and rsT4.3 in pBG381. Line D represents the protein 
domain structure of E.coli rsT4 gene (Met-perfect 
construct) (pl99-7) which is deleted for the T4 
N- terminal signal sequence (S). 

We constructed the first three soluble T4 

15 mutant gene fragments by truncating our full length 
soluble T4 cDNA at positions corresponding to either 
intron/exon boundaries or to protein domain boundaries 
defined by hydropathy analysis predictions. More 
specifically, we introduced synthetic linkers into 

20 the unique Aval site that is 5' to the transmembrane/ 
extracellular domain boundary to produce an in- frame 
trans lational stop codon, thus constructing T4 genes 
that lack the transmembrane and cytoplasmic domains 
of the full length T4 sequence. 

25 For example, mutant rsT4.1 in pBG377 was 

truncated by the insertion of a stop codon following 
amino acid 362, lysine, which corresponds to the 
position of an intron separating the extracellular 
and transmembrane domain exons. The positions both 

30 of this intron and of the adjacent intron that splits 
the transmembrane and cytoplasmic domains were deter- 
mined by DNA sequence analysis of chromosomal T4 
clones isolated from the \EMBL3 genomic library 
described above. Although the significance of the 

35 intron positions flanking the T4 transmembrane domain 
is not known, the determination of the genetic struc- 
ture could provide important information for design- 
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ing rsT4 mutants, since exons frequently define 
functional domains [W. Gilbert, "Why Genes In Pieces?", 
Nature, 271, p. 501 (1978)]. 

We then constructed mutant rsT4.2 in pBG380 
5 by truncating the T4 cDNA at the boundary of the 
transmembrane and extracellular domains at amino 
acid 374. And, we constructed mutant rsT4.3 in 
pBG381 by truncating the T4 cDNA at amino acid 377, 
three amino acids downstream from the transmembrane/ 

10 extracellular domain boundary and within the trans- 
membrane domain. 

We also employed the technique of oligo- 
nucleotide site directed mutagenesis, according to 
D. Strauss et al . , "Active Site Of Triosephosphate 

15 Isomerase: In Vitro Mutagenesis And Characterization 
Of An Altered Enzyme", Proc. Natl. Acad. Sci. USA , 
82, pp. 2272-76 (1985), to construct a fourth soluble 
T4 mutant from our full length T4 clone PBL T4. The 
structure of this mutant is depicted in Figure 7A, 

20 line D, which represents the protein domain structure 
of E.coli rsT4 gene (Met-perfect rsT4.2) construct, 
deposited in pl99-7, which is deleted for the T4 
N-terminal signal sequence (S). 

We also constructed various other soluble 

25 T4 deletion mutants to determine which smaller 

fragments of the T4 sequence provide a protein which 
binds to HIV. These constructions were based on our 
belief that only the amino terminal sequence of T4 
is required for binding to HIV. This belief, in turn, 

30 was based upon observations that the monoclonal anti- 
body 0KT4A blocks infection of T4 positive cells by 
HIV and it appears to recognize an epitope in the 
amino portion of T4 [ Fuller et al. , supra] . Such 
fragments of T4, which lack glycosylation and which 

35 are capable of binding HIV and blocking infection, 

may be produced in E.coli or chemically synthesized. 
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The structure of each of these deletion 
mutants is depicted in Figure 7B. In that figure, 
line A depicts the protein domain structure of full 
length T4 f Maddon et al. , (1985), supra; Figure 7A] . 
5 in line B, the protein structure of recombinant 

soluble T4 mutants are depicted as follows: rsT4.7 
in p203-5, rsT4.7 in pBG392, rsT4.8 in pBG393, rsT4.9 
in pBG394, rsT4.10 in pBG395, rsT4.11 in pBG397, 
rsT4.12 in pBG396, rsT4.111 in pBG215-7, rsT4. 113.1 

10 in pBG211-ll and rsT4.113.2 in pBG214-10. 

We constructed soluble T4 derivatives 
p203-5, pBG392, pBG393, pBG394 and pBG396 by trun- 
cating our rsT4.2 gene after the Stu I sites at amino 
acids 183 and 264 of rsT4.2. More specifically, we 

15 constructed derivative rsT4.7 in p203-5 and in pBG392 
by truncating the rsT4.2 cDNA at amino acid 182. 
And, we constructed each of derivatives rsT4.9 in 
pBG394 and rsT4.12 in pBG396 by truncating the rsT4.2 
cDNA at amino acids 113, and 166, respectively. One 

20 may also construct each of derivatives rsT4.10 in 
pBG395 and rsT4.11 in pBG397 by truncating the 
rsT4.2 cDNA at amino acids 131 and 145, respectively. 

Expression of T4 and Soluble T4 
Polypeptides In Bacterial. Cells 

25 The cDNA sequences of this invention can 

be used to transform eukaryotic and prokaryotic host 
cells by techniques well known in the art to produce 
recombinant soluble T4 polypeptides in clinically 
and commercially useful amounts. 

30 For example, we constructed expression 

vector pl99-7, as shown in Figure 9A, as follows. 

We preceded the construction depicted in 
Figure 9A by the construction of various intermediate 
plasmids, as depicted in Figures 8A-8D. Those con- 

35 structions were carried out using conventional 
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recombinant techniques. The linkers employed in 
those constructions are set forth in Figure 10. 

As depicted in Figures 8A and 8B, starting 
with pl70-2, which contains our full length T4 DNA 
5 sequence, coding for T4 characterized by three dif- 
ferent amino acids than that of Maddon et al . , (1985), 
supra, we produced various constructs which direct 
the expression of soluble T4. Some of these con- 
structs are characterized in that one or more of 
10 those amino acid differences have been changed to 
correspond to the respective amino acids of Maddon 
et al. In this figure, as well as in the other 
figures, amino acid changes are reflected by an 
arrow. 

15 Plasmid pl92-6 contains the Met perfect 

rsT4.2 sequence derived by oligonucleotide site- 
directed mutagenesis which removed the entire T4 
N-terminal signal sequence as shown in Figure 8C. 
And, to provide a convenient means of transferring 

20 the rsT4.2 Met perfect sequence into E.coli expression 
vectors, the steps described in Figure 8D were carried 
out to produce pl95-8, a plasmid containing the Met 
perfect rsT4.2 sequence flanked by Cla l restriction 
sites. The Cla l - Cla l cassette of pl95-8 optimizes 

25 the distance between the 5' Cla l site and the 
initiating Met codon. In Figure 8D f ST8 rop" 
is a tetracycline resistance encoding pAT153- 
based plasmid containing the rop~ mutation that 
permits high plasmid copy number, a promoter and 

30 ribosome binding site from bacteriophage gene 32 and 
the gene 32 transcription termination sequence. 

Cleavage of pl95-8 with Cla l produced the 
fragment used to assemble pl99-7, a construction 
which directs the expression of Met perfect rsT4.2 

35 under the control of the P_ promoter (Figure 9A) . 

Li 

As the first step, to construct a vector from which 
rsT4.2 expression is under control of the P.. promoter, 
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we constructed the vector pl97-12 from pl034 
(plmuGCSF) (Figure 9A). 

We then cut pl034 with EcoRI and BamHI to 
excise the GCSF cDNA insert and a portion of the 
5 phage mu ribosome binding site sequence — which we 
subsequently reconstructed with oligonucleotides. 
The synthetic linkers used were linkers 57-60 
(Figure 10) . 

We then ligated the synthetic linker into 

10 the EcoRI/BamHI-cut pl034 to form pl97-12. One could, 
instead, replace these steps by starting with any 
suitable E.coli expression vector containing a Clal 
site appropriately placed between the promoter and 
terminator sequences. We cut pl97-12 with Clal and 

15 inserted a Clal -Clal cassette containing the cDNA 

sequence of rsT4.3 in pBG381 and phage transcription 
terminator derived from p!034. The sequence of this 
cassette is depicted in Figure 11. The resulting 
plasmid, pl99-7, contains the rsT4.2 "Met perfect" 

20 gene in that vector. 

Alternatively, one could derive the Met 
perfect rsT4.2 sequence from plasmid pBG380, 
deposited in connection with this application, and 
gap out the signal sequence to create pl92-6. 

25 We tested for expression of pl99-7 as 

follows. SG936, an E.coli Ion htpr double mutant 
[ATCC 39624] [S. Goff and A. Goldberg, "ATP -Dependent 
Protein Degradation In E.coli ", in Maximizing Gene 
Expression , W. Reznikoff and L. Gold (eds.) (1986)], 

30 was transformed with pl99-7 by conventional proce- 
dures r Maniatis et al. (1982)] to form SG936/pl99-7 , 
a trans form ant containing a plasmid with the Met- 
perfect rsT4.2 gene behind the P L promoter- Trans- 
formants were selected on LB agar plates containing 

35 10 racg/ml tetracycline (tet). After streaking out 

several single colonies for single colony isolation, 
one was chosen at random for testing induction of 
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rsT4.2 synthesis. We picked a single colony from an 
LB- agar tet + plate into 20 ml Luria Broth (LB) and 
10 mcg/ml tet in a 125 ml shake flask and grew it 
overnight in a shaking air incubator (New Brunswick 
5 Scientific, New Jersey) at 30 °C. 

We then initiated an induction culture by 
adding 0.5 ml of the overnight culture to 50 ml LB 
and tet in a 500 ml flask which was grown at 30 °C in 
a shaking air incubator. When the culture reached 

10 an OD(600) of 0.4, we transferred it to a 42 °C water- 
bath and shook it gently for approximately 20 minutes. 
After heat induction at 42°C, the flask was trans- 
ferred to a 39 °C air incubator (New Brunswick 
Scientific, New Jersey) where it was shaken vigorously 

15 at 250 rpm. We withdrew samples just after the 42 °C 
heat shock, and at hourly time points for 4 hours, 
and then after overnight growth. The samples were 
measured for growth by OD(600) and analyzed following 
SDS-PAGE for the pattern of protein synthesis by 

20 Coomassie blue protein staining and by Western blot 
analysis with our rabbit antipeptide antibody probes 
(described above). Based on the relative molecular 
weight and protein blot analysis, the expression of 
rsT4.2 was induced from SG936/pl99-7 following heat 

25 induction at 42°C (Figure 12). 

We transformed pl99-7 into a P L mu.tet 
expression vector, an E . coli expression vector, at 
the unique Clal site (see Figure 11). The nucleo- 
tide and amino acid sequences of pl99-7 are shown in 

30 Figure 11. 

The expression of soluble T4 from pi 99-7 
in E.coli was measured by Western blot analysis of 
whole cell extracts following SDS-FAGE using the 
rabbit polyclonal anti- pep tide JB-1 or anti -peptide 
35 JB-2 antibodies as probes (Figure 12). 

We also constructed expression vector 
p203-5, as shown in Figure 9B, as follows. 
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We started with pl97-7, which has the same 
sequence as the P L um vector pl97-12 (see Figure 9A), 
except that there is a single nucleotide deletion in 
the 5' noncoding region following the F L promoter. 
5 That deletion, which is a deletion of nucleotide 

#40 — adenine — of pl97-12 (see Figure 11), resulted 
from a deletion in the region that was constructed 
from linkers 57-60 (see Figure 10). pl97-7 contains 
the rsT4.2 gene comprising 374 amino acids. Alter- 
10 natively, one could also use pl97-7 as a starting 
plasmid. 

We cut pl97-7 with Cla l . We also cut pl95-8 
(see Figures 8D and 9A) with Cla l to remove the 
Cla l - Clal cassette containing the cDNA sequence of 

15 rsT4.2. Subsequently, we inserted the Clal -Clal 
cassette into pl97-7 to produce pl98-2 . 

We then digested pl98-2 with Stu l to 
remove 80 amino acids (amino acid 185 to amino acid 
264) of the mature T4 protein coding sequence. Unex- 

20 pected methylation, however, prevented cutting at 

the second Stu l site, so that only the Stul site at 
amino acid 184 was cleaved. Following ligation, the 
plasmid DNA was transformed into E.coli and we 
examined several plasmid clones for the deletion 

25 using standard procedures. None of those plasmids 
contained the expected Stu l deletion. 

Subsequent DNA sequence analysis of one 
of these plasmids, called p203-5, showed that two 
guanine residues (see amino acids 183 and 184; 

30 nucleotides 818 and 819 of Figure 3) of the Stul 
recognition sequence had been deleted following 
cleavage due to exonuclease digestion caused by the 
use of exonuclease-contaminated Stu l enzyme. This 
dinucleotide deletion produced a translation frame- 

35 shift following amino acid 182 (glutamine) and intro- 
duced a stop codon six amino acid codons downstream 
from the frameshift (Figure 9C). The unexpected 
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methylation of the second StuI site together with 
the deletion that resulted in a new stop codon 
produced a gene encoding a shortened form of recom- 
binant soluble T4, called rsT4.7. The rsT4.7 sequence 
5 encodes a 182 amino acid N-terminal segment of the * 
mature T4 sequence followed by, at the C-terminus, 
six amino acids — asparagine-leucine-glutamine- ^ 
histidine-serine-leucine — of non-T4 sequence and 
finally by a TAA stop codon. 
10 The expression of soluble T4 from p203-5 

in E.coli was measured by Western blot analysis 
as previously described. 

Expression of T4 and Soluble T4 
Polypeptides In Animal Cells 

15 We inserted both soluble T4 genes and the 

unmodified gene encoding membrane-bound T4 into 
animal expression vector pBG368. More specifically, 
we inserted each of the soluble gene constructs into 
pBG368 under the transcriptional control of the 

20 adenovirus late promoter , to give plasmids pBG377, 
pBG380 and pBG381. We also made two pBG312-based 
constructions, called pBG378 and pBG379, which 
direct the expression of recombinant full length T4 
protein. pBG378 and pBG379 code for the same full 

25 length T4 protein but in pBG379, a portion of the 3' 

untranslated sequence has been removed. Subsequently, 
to test for expression of recombinant soluble T4 and 
recombinant full length T4, we cotransfected Chinese 
hamster ovary ("CHO") cells with one of each of 

30 those plasmids and with the plasmid pAdD26. s 
We first constructed pBG3 68 as follows. 
As depicted in Figure 13, we cut animal cell expres- * 
sion vector pBG312 [R. Cate et al., "Isolation Of 
The Bovine And Human Genes For Mullerian Inhibiting 

35 Substance And Expression Of The Human Gene In Animal 
Cells", Cell , 45, pp. 685-98 (1986)] with EcoRI and 
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, Bglll to delete one of each of the two EcoRI and the 
two Bglll restriction sites (the EcoRI site at posi- 
tion 0 and the B3III site located at approximately 
position 99). The resulting plasmid, pBG368, retained 
5 an EcoRI site in the cloning region and a BgJ.II site 
after the cloning region. This left a single EcoRI 
site and a single Bglll site in the polylinker for 
cloning purposes. 

More specifically, we deleted one Eco RI 

10 site and one Bglll site by sequential partial diges- 
tion of pBG312 with restriction enzymes EcoRI and 
Bglll, respectively. We filled in with Klenow and 4 
nucleotides then religated to produce pBG368, which 
contains unique restriction sites for Eco RI and Bglll 

15 enzymes. 

Once transient expression of soluble T4 
was verified, we constructed stable cell lines that 
continuously expressed soluble T4. To do this, we 
employed the stable cell expression host, the 

20 dihydrofolate "reductase deletion mutant (DHFR ) 
Chinese hamster ovary cell line [F. Kao et al., 
"Genetics Of Somatic Mammalian Cells X Complementation 
Analysis of Glycine-Requiring Mutants", Proc. Natl. 
Acad. Sci. , 64, pp. 1284-91 (1969); L . Chasin and 

25 G, Urlab "Isolation Of Chinese Hamster Cell Mutants 
Deficient In Dihydrofolate Reductase Activity", 
Proc. Natl. Acad. Sci. , 77, pp. 4216-80 (1980)]. 

Using this system, we cotransfected each 
T4 gene construct with pAdD26 [R. J. Kaufman and 

30 P. A. Sharp, "Amplification And Expression Of 

Sequences Cotransfected With a Modular Dihydrofolate 
Reductase Complementary DNA Gene 11 , J. Mol . Biol . , 
159, pp. 661-21 (1982) containing the mouse DHFR 
gene. Before carrying out the co-transfections , we 

35 linearized all plasmids by restriction enzyme cleavage 
and, prior to transfection, we mixed each plasmid 
with pAdD26 so that the molar ratio of pAdD26 to T4 
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was 1:10. This maximized the number of T4 gene copies 
per trans feet ant. 

Within the cell, the plasmids were ligated 
together to form polymers that can become integrated 

5 into host chromosomal sequences by illegitimate 

recombination [J. Haynes and C. Weissmann, "Constitu- 
tive , Long-Term Production Of Human Interferons By 
Hamster Cells Containing Multiple Copies Of a Cloned 
Interferon Gene", Nucl. Acids Res. , 11, pp. 687-706 

10 (1983); S. J. Scahill et al., "Expression And Charac- 
terization Of The Product Of A Human Immune Interferon 
cDNA Gene In Chinese Hamster Ovary Cells", Proc. Natl. 
Acad. Sci. USA , 80, pp. 4654-58 (1983)]. We selected 
trans fectants that express the mouse DHFR gene in 

15 culture medium lacking nucleotides. We then subjected 
these trans fectants to a series of increasing concen- 
trations of methotrexate, a toxic folate analogue 
that binds DHFR, to select for cells levels of DHFR. 

Resistance to methotrexate by increased 

20 expression of DHFR is frequently the result of DHFR 

gene amplification, which can include the reiteration 
of large chromosomal segments, called amplified 
units [R. J. Kaufman and P. A. Sharp, "Amplification 
And Expression Of Loss Of Dihydrofolate Reductase 

25 Genes In A Chinese Hamster Ovary Cell Line", Molec. 
Cell. Biol. , 1, pp. 1069-76 (1981)]. Therefore, 
cointegration of DHFR and rsT4 sequences permitted 
the amplification of rsT4 genes. Stably transfected 
cell lines were isolated by cloning in selective 

30 growth medium, then screened for T4 expression with 

a T4 antigen (RIA) [D. Klatzmann et al., Nature , 

312, pp. 767-68 (1984)] and by immunoprecipitation 

35 

from conditioned medium after [ S] cysteine 
(" 35 S-Cys 11 ) metabolic labelling. 
35 we also inserted the soluble T4 derivative 

rsT4.7 gene into an animal cell expression plasmid 
as follows . 
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As set forth in Figure 14C, we cut plasmid 
pBG381 (Figure 14A) with Eco RI and Nhe l . We then cut 
pl86-6 with EcoRI and Nhel to remove the 786 base 
pair fragment. We ligated that fragment into the 
5 digested pBG381 to form plasmid pBG391. The T4 
sequence in pBG391 is identical to both that of 
Maddon et al. (1985) supra at positions 64 (tryptophan) 
and 231 (phenylalanine) and to that of pBG381. How- 
ever, at position 3, the asparagine reported by 

10 Maddon et al. and present in pBG381 is replaced with 

lysine. The nucleotide sequence of pBG391 is depicted 
in Figure 15. 

We then digested p203-5 with Nhe l and 
Oxanl to remove the 483 base pair fragment. We 

15 inserted that fragment into Nhel / Ox an I -digested 
pBG391 to form plasmid pBG392, the animal cell 
expression construct of rsT4.7. The T4 sequence in 
rsT4.7 contains amino acids identical to that of 
Maddon et al . 1 s full length sequence at amino acid 

20 positions 64 (tryptophan) and 231 (phenylalanine). 
However, at position 3, the asparagine reported by 
Maddon et al . is replaced with lysine. The nucleo- 
tide sequence of pBG392 is depicted in Figure 16. 

In Figure 14D, we have depicted the con- 

25 struction of other animal cell expression constructs 
containing sequences encoding the deletions rsT4.9 
in pBG394, and rsT4.12 in pBG396. Those constructions 
were carried out using conventional recombinant tech- 
niques. The linkers employed in those constructions 

30 are set forth in Figure 18. The nucleotide sequences 
of pBG394 and pBG396 are shown in Figures 19 and 20. 

Plasmid pBG393, shown in Figure 17, con- 
tains rsT4.8, the perfect form of rsT4.7. pBG393 
contains 182 amino acids of the mature T4 sequence, 

35 without the additional non-T4 6 amino acids at the 

C-terminus following amino acid 182. The nucleotide 
sequence of BG393 is shown in Figure 21. 
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Other animal cell expression plasmids 
according to this invention may be constructed as 
depicted in Figure 17. These include rsT4.10 in 
pBG3 95 and rsT4.11 in pBG397 (see Figure 18 for 
5 specific linkers). 

The nucleotide sequence of BG395 is shown 
in Figure 22. 

Purification Of Recombinant Soluble T4 

Recombinant soluble T4 construct pBG380 

10 expressed in DHFR~ CHO cells was grown to confluency 
in a or-Modified Eagles Medium (Gibco) supplemented 
with 10% fetal calf serum, 1 mM glutamine and the 
antibiotics penicillin and streptomycin (100 [ig/ml 
of each). The cells were grown at 37°C in two 21 Cell 

15 Factory Systems (Nunc). We then washed the confluent 
cells free of fetal calf serum with a -Modified Eagles 
Medium without fetal calf serum and cultured the 
cells in a-Modified Eagles Medium at 37°C for 4 days. 
Subsequently, we harvested the conditioned media, . 

20 filtered it through a Millipore Minidisk 0.22m 

hydrophilic filter cartridge (Millipore #MCGL 305-01) 
and concentrated the secreted proteins on a fast-S 
ion exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

25 We then eluted the bound proteins with 20 mM 

Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution pool 
was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising immobilized 19Thy anti-T4 monoclonal anti- 

30 body coupled to Affigel-10 [a gift of Dr. Ellis 
Reinherz, Dana Farber Cancer Institute, Boston, 
Massachusetts] . We washed the column extensively 
and eluted the bound material as 0.5 ml fractions 
with 50 mM glycine-HCl (pH 2.5), 150 mM NaCl, 0 . 1 mM 

35 EGTA and 5 jjg/ml bovine pancreatic trypsin inhibitor, 
Aprotinin (Sigma #A1153 ) . We used Western blots 
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developed with rabbit antisera raised against peptide 
JB-2 to follow the purification. We employed silver 
stained gels to follow binding and elution of rsT4.2 
during the chromatography. Figure 23 depicts a 
5 Coomassie stained gel of purified rsT4.2. 

Gel sizing-column chromatography analysis 
of the purified rsT4.2 from the pBG380 transfected 
CHO cell line, BG380G, suggests that rsT4 is raonmeric 
under physiologic pH and salt concentration. 

10 Sequencing Of Recombinant 
Soluble T4 Protein 

We then determined the N-terminal amino 
acid sequence of a recombinant soluble T4, specifi- 
cally rsT4.2, molecule purified from the conditioned 

15 medium of the pBG380 transfected CHO cell line BG8QG, 
as described above, by automated Edman degradation 
in an Applied Biosystems 470A gas phase sequenator 
[R. B. Pepinsky et al., J. Biol Chem. , 261, 
pp. 4239-46 (1986)]. 

20 The amino terminal sequence matched the 

sequence which we had previously determined for 
solubili2ed native T4 isolated from U937 cells, supra . 
The amino terminal sequences of native solubilized 
T4 (sT4) and purified rsT4 protein are A2 proteins, 

25 as compared to the amino terminal sequence predicted 

by Maddon et al., (1985), supra, with the mature amino 
terminus located at position 3 of that sequence. The 
amino terminal sequences of solubilized native T4 
(sT4), recombinant soluble T4 (rsT4.2) secreted by 

30 CHO trans fee tan t BG380G containing pBG380 and the 
protein sequence deduced by Maddon et al. (1985), 
supra are as follows: 



sT4: 
rsT4.2: 



X-K-V-V-L-X-K-K-X-D-T-V-E-L-T-X-T-A-S-E 
N-K-V-V-L-G-K-K-G-D-T-V-E-L-T-X-T-A-S-E 
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Maddon 

etf a X. q-g-N-K-V-V-L-G-K-K-G-D-T-V-E-L-T-C-T-A-S-E 



In the above sequences, the amino acids 
are represented by single letter codes as follows: 



Phe: 


F 


Leu: 


L 


lie: 


I 


Met: 


M 


Val: 


V 


Sen 


S 


Pro: 


P 


Thr: 


T 


Ala: 


A 


Tyr: 


Y 


His : 


H 


Gin: 


Q 


Asn: 


N 


Lys: 


K 


Asp: 


D 


Glu: 


E 


Cys : 


C 


Trp: 


W 


Arg: 


R 


Gly: 


G 



10 X: not determined or ambiguous. 

We also constructed pBG211-ll, a plasmid 
coding for the N-terminal 113 amino acids of soluble 
T4 protein. This construct, which codes for a pro- 
tein characterized by a single disulfide bridge, 
15 between the cysteines at amino acid positions 18 and 
86, is conveniently expressed in E.coli . 

To construct p211-ll, as depicted in Fig- 
ure 24, we first cut pl95-8 (see Figures 8D and 9A) 
with Clal to remove the Cla l- Cla l cassette contain- 
20 ing the cDNA sequence of rsT4.2. We then digested 
pAT153Y3SH16AAmp, the tryptophan operon promoter 
plasmid from the gamma interferon producing E.coli 
strain BN374 with Cla l, and deleted the cDNA coding 
for gamma interferon. Subsequently, we inserted 
25 the Clal -Clal cassette into the Cla l -cut E.coli 

plasmid in front of the tryptophan operon promoter 
and ligated to produce pl96-10. 

As shown in Figure 25, we then subjected 
pBG380 to oligonucleoti de-directed mutagenesis to 
30 insert three tandem trans lational stop codons follow- 
ing the T4 cDNA sequence coding for amino acids -23 
to 113 in pBG380, lo produce pBG394. 

We then constructed p211-ll from fragments 
of each of pl96-10, pBG394 and pl034 as depicted in 
35 Figure 26. The first fragment including the vector 
sequences, was produced by restricting pl96-10 with 
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Hindlll and Clal to remove the T4 coding sequence 
from amino acids 61 through 374 of rsT4.2 and includ- 
ing vector sequence following the 3' end of the rsT4 
gene. The second fragment, a Hind i I I - Bglll segment 
5 including the codons for T4 amino acids 61-113 of 
rsT4.9 immediately followed by a triplet of stop 
codons in tandem, was isolated by Hind i I I/Bgl I I diges- 
tion of pBG394. The third fragment, a BamHI - Clal 
fragment containing a bacteriophage T4 transcriptional 

10 termination signal [H. N . Kirsch and B. Allet, 

"Nucleotide Sequences Involved In Bacteriophage T4 
Gene 32 Translational Self -Regulation" , Proc. Natl. 
Acad. Sci. USA , 79, pp. 4937-41 (1982)], was isolated 
by BamHI/Clal digestion of pl034. We then ligated 

15 these three fragments to produce p211-ll, a T4 con- 
struct coding for a 113 amino acid soluble form of 
T4 protein, with asparagine at amino acid position 3 

(i.e., rsT4. 113.1). 

We then subjected p211-ll to oligonucleo- 
20 tide site-directed mutagenesis (Figure 27) to change 
the amino acid at position 3 from asparagine to 
lysine using the oligonucleotide T4-66: 

* j 

;t AAA 

I I 

AAA GTA GTA CTG 
GGC 3' . 

This produced plasmid p214-10, a fully 
corrected 113 amino acid soluble T4 vector coding 
for a 113 amino acid soluble form of T4 protein, 
with lysine at amino acid position 3 (i.e., 
rsT4.113.2). As shown in Figure 27, we subjected 
p214-10 to oligonucleotide site-directed mutagene- 
sis to delete glutamine and glycine at, respectively, 



25 



30 



ATG CAG GGT 
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amino acid positions 1 and 2 of the T4 sequence using 
the oligonucleotide T4AID-87: 



This produced p215-7, a 111 amino acid 
soluble T4 construct, including the trp promoter, 
which directs the expression of a 111 amino acid 
10 soluble form of T4 protein, with lysine at amino 
acid position 3 (i.e., rsT4.111). 

We next constructed p218-8, a 111 amino 
acid construct which directs the expression of a 111 
amino acid soluble form of T4 protein, with lysine 
15 at amino acid position 3 (i.e., rsT4.111) under the 

control of the P r promoter, as depicted in Figure 28. 

More specifically, we cut pl97-12 (-Figure 
9A) with Cla l to remove the 101 bp fragment contain- 
ing linker and terminator sequences. We also cut 
20 p215-7 with Cla l to remove the Cla l - Cla l cassette 
containing the cDNA sequence of rsT4.111 and the <(>T4 
transcriptional terminator sequence f Kirsch and Allet , 
supra] . Subsequently, we inserted the Cla l - Cla l 
cassette into the Cla l -cut pl97-12 to produce p218-8. 
25 In order to express rsT4. 113.1, we trans- 

formed E.coli A89 with p211-ll by conventional 
techniques [Maniatis et al . (1982), supra] to form 
E.coli A89/p211-ll. E.coli A89 is a tetracycline 
sensitive derivative of E.coli SG936. We isolated 
30 E.coli AS 9 from E.coli SG936 according to the method 
of S. R. Maloy and W. D. Nunn, "Selection For Loss 
Of Tetracycline Resistance By Escherichia coli " , 
J. Bact. , 145, pp. 110-12 (1981), which is based 
upon the ability of the lipophilic chelating agent 
35 fusaric acid to selectively inhibit resistant strains. 



C 



5 



5' GTA TCG ATT TGG 
ATG ATG AAA AAA 
GTA GTA 3 ' . 
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More specifically, we plated E.coli SG936 on medium 
containing, per liter, 5 g tryptone, 5 g yeast extract, 
10 g NaCl, 10 g NaH 2 P0 4 -H 2 0, 50 mg chlortetracycline- 
HC1, 12 mg fusaric acid, 0.1 mM ZnCl 2 and 15 g agar. 
5 Colonies which grew at 30 °C (putative tetracycline- 
sensitive strains) were retested for tetracycline 
sensitivity on L-agar plates containing 5 ^g/ml 
tetracycline. One tetracycline- sensitive strain, 
designated A89, was then shown to be unable to grow 

10 on LB agar at 42 °C, thus verifying the presence of 
the htpR mutation. 

Trans formants were selected by tetracycline 
resistance. We picked a single colony into 20 ml of 
minimal medium plus 0.2% casamino acids plus trypto- 

15 phan (100 ug/ml) plus tetracycline (10 ug/ml) in a 

100 ml shake flask placed in a shaking air incubator 
at 30°C and allowed the cells to grow up overnight. 
The following morning, we inoculated 40 ml of mini- 
mal medium plus 0.2% casamino acids plus tryptophan 

20 (100 ug/ml) plus tetracycline (10 ug/ml) with the 

overnight culture at OD g00 = 0.05 in a 500 ml flask. 
The cells were grown to midlog phase and then induced 
by pelleting, washing once in minimal medium and 
then resuspending in minimal medium plus 0.2% cas- 

25 amino acids plus tetracycline (10 ug/ml), in the 
absence of tryptophan. We removed 0.6 OD &00 of 
cells after 0, 1, 2, 3 and 4 hours incubation and 
after growth overnight. 

The aliquots were centrifuged and cell 

30 pellets were subjected to lysis by boiling in 

Laemmli gel loading buffer. After centrifugation to 
remove cell debris, half of each sample was subjected 
to SDS-PAGE, followed by Western blot analysis with 
our rabbit antipeptide antibody probes or by Coomassie 

35 blue protein staining (Figures 29A and 29B). 
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Purification Of rsT4. 113.1 

We then purified rsT4. 113.1 from the E.coli 
trans form ant by means of two essentially quantitative 
steps involving ani on-exchange and gel- filtration 
5 chromatographies performed under reducing and dena- 
turing conditions. 

More specifically, we suspended 14 g of 
wet cells from a 4 L shake-flask fermentation in 
100 ml of a 20mM Tris (pH 7.5) buffer containing 
10 20 vg/ml DNase, 20 vq/ml RNase and 1 mM phenylmethyl- 
sulfonyl fluoride ("PMSF"). The suspension was applied 
to a French Press at 1000 psi in two passages and 
then centrifuged in an SA 600 rotor at 18,000 g for 
15 min at 4°C. The resulting pellet was solubilized 
15 in 20 ml of a 20 mM Tris (pH 7.5) buffer containing 
7 M urea and 10 mM 2-mercaptoethanol . We then sub- 
jected the suspension to ultracentrifugation at 
85,000 g for 90 min at 4°C. The supernatant was 
diluted by the addition of 80 ml of 20 mM Tris 
20 (pH 7.5) buffer containing 7 M urea and 10 mM 
2-mercaptoethanol and 40 ml of the sample was 
applied to a 3 x 4 cm Q-Sepharose fast- flow column 
(Sigma, St. Louis, Missouri) which had been pre- 
eguilibrated in the same buffer. The column was 
25 developed with a gradient in 400 ml total volume of 

increasing NaCl from 0 to 0.3 M in the same Tris/urea/ 
2-mercaptoethanol buffer. Column fractions were 
monitored for absorbance at 280 nm and for protein 
content by SDS-PAGE (15% aery 1 amide). The fractions 
30 were also analyzed by Western blots. Figure 30, 

panel (a) is a chromatogram displaying the purifi- 
cation of rsT4. 113.1 by ion-exchange chromatography. 
In that figure, peaks containing rsT4. 113.1 are 
identified. The rsT4. 113.1 was found to elute early 
35 in the NaCl gradient and to be well -re solved from 
low-molecular weight contaminants . 



WO 89/01940 



PCT/US88/02940 



-63- 

In order to separate rsT4. 113.1 from high- 
molecular weight contaminants, we carried out gel- 
filtration chromatography on an rsT4 . 113 . 1-containing 
pool for final purification of the protein to near 
5 homogeneity { >95% purity ) - More specifically, we 
prepared a pool containing 20 mg of protein in 50 ml 
and then concentrated to 10 ml in a stirred-cell 
ultrafiltration unit (Araicon, Danvers, MA.) using a 
PM-30 membrane (Amicon). Subsequently, 5.0 ml of 

10 the concentrate was applied to a 1.5 x 95 cm S-300 
column (Sigma) equilibrated and developed in the 
same Tris/urea/2-mercaptoethanol buffer. We moni- 
tored the column fractions for absorbance at 280 nm 
and for protein content by SDS-FAGE. The fractions 

15 were also analyzed by Western blots. A pool con- 
taining rsT4. 113.1 (approximately 4 mg) in 15 ml was 
thus prepared. Figure 30, panel (b) is a chromato- 
gram displaying the purification of rsT4 . 113.1 by 
gel-filtration separation of the rsT4. 113.1 pool. 

20 in that figure, peaks containing rsT4. 113.1 are 
identified. 

Figure 30, panel (c) is an SDS-PAGE analysis 
depicting the purification of the rsT4 derivative 
throughout the centrifugation and chromatography 
25 steps. in Figure 30, panel (c), the lanes depicted 
are: 

lane A: molecular weight standards 
lane B: cell extracts 

lane Cr cell pellet following solubilization 
30 of cell extract in non-denaturing 

conditions 

lane D: supernatant following solubilization 
of cell extract in non-denaturing 
buffer 

35 lane E: supernatant following ultracentri- 

fugation step 
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lane F: Q-Sepharose pool 

lane G: S-300 gel- filtration pool. 

Refolding Of Purified rsT4. 113.1 

We refolded the purified rsT4. 113.1 by 
5 dilution and dialysis steps to non- denaturing and 
oxidized conditions. More specifically, refolding 
of the protein at a concentration of 0.5 OD (280)/ml 
was achieved by stepwise dialysis against 500 volumes 
of 3 M urea, 20 mM Tris (pH 7.5); 500 volumes of 1 M 

10 urea, 0.1 M ammonium acetate (pH 6.8) and, finally, 
the same volume of a phosphate-buffered saline solu- 
tion. Throughout the refolding procedure, samples 
of the protein were monitored for relative content 
by spectral analysis and by high-performance liquid 

15 chromatography ("HPLC" ) performed on a 150A liquid 
chromatographic system (Applied Biosystems, Inc., 
Foster City, California). An octasilyl column 
(Aquapore RP-300, 0.46 x 3.0 cm) was equilibrated in 
80% 0.1% trifluoroacetic acid ( "TFA" ) /water (sol- 

20 vent A) and 20% 0.085% TFA/70% acetonitrile (sol- 
vent B) and developed with a linear gradient of 
increasing acetonitrile concentration from 20% to 
80% (solvent B) over 45 min at a flow rate of 
0.5 ml/min. 

25 As shown in Figure 31, panel (a), protein 

in 7 M urea, 10 mM 2-mercaptoethanol and 20 mM 
Tris(pH 7.5) eluted from the HPLC column at 49% 
acetonitrile in the gradient. In subsequent steps, 
from 1 M urea/1 mM ammonium acetate (pH 6.8) [Fig- 

30 ure 31, panel (b)] to phosphate buffered saline 

[Figure 31, panel (c)], an increasing percentage of 
rsT4. 113.1 was found to elute earlier in the HPLC 
gradient — at 47% acetonitrile. The identity of 
the earlier eluting peak as oxidized product was 

35 verified by reduction of rsT4. 113.1 in no n-chao tropic 
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solutions and application of sample thus treated to 
HPLC under the same conditions . 

The elution of oxidized rsT4. 113.1 prior to 
reduced protein on HPLC suggests that formation of 
5 the single disulfide bridge decreases relative hydro- 
phobicity of the protein [J. L. Browing et al . , Anal. 
Biochem. , 155, pp. 123-28 (1986)]. Spectral analysis 
of rsT4. 113.1 was performed throughout the course of 
refolding in order to monitor relative yield of solu- 
10 ble protein in the procedure. The refolding method 
allowed approximately 20% recovery of rsT4. 113.1. 
HPLC analysis indicated a less than 15% contaminant 
of reduced protein in the preparation (Figure 30, 
panel <c), lane G). 

15 Sequencing Of Renatured rsT4.113 

We then carried out amino acid analysis of 
rsT4. 113.1 by automated Edman degradation in an 
Applied Biosystems 470A gas phase sequenator equipped 
with a 900 A data system. Phenylthiohydantion amino 

20 acids generated during the course of the degradative 
chemistry were analyzed on-line using an Applied 
Biosystems 120A PTH-analyzer equipped with a PTH-C18 
2.1 x 220 mm column. Protein (10 pg) for sequence 
analysis was applied to SDS-PAGE (15% acrylamide) 

25 and electroblotted on an Immobilon membrane (Millipore 
Corp., Bedford. Massachusetts) as described by 
P. Matsudaira, J. Biol. Chem. , 262, pp. 10035-38 
(1987). 

Amino acid analysis of protein samples was 
30 performed by hydrolysis of protein in 6 N HCl, in 
vacuo, for 24 h at 110°C. The hydrolysates were 
then applied to a Beckman 6300 Analyzer equipped 
with post-column detection by ninhydrin. Western 
blot analysis of the SDS-PAGE gels was carried out 
35 by standard techniques using rabbit antisera JB-1 . 
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Sequence analysis revealed an amino terminal sequence 
of: Met-Gln-Gly-Asn-Lys-Val-Val ... 

The purified rsT4. 113.1 protein was found 
to contain stoichiometric quantities of the amino- 
5 terminal methionine placed in the protein construct 
for expression in E.coli and an intact polypeptide 
chain consistent with a sequence derived from the 
plasmid construction. Recovery of phenyl thiohydan- 
toinyl -methionine at the first cycle of the degrada- 

10 tive chemistry was 60% consistent with routine initial 
yields obtained in the automated Edman. This obser- 
vation excludes the possibiity that a significant 
percentage of the rsT4. 113.1 lacked the initiation 
methionine, i.e., the NH 2 - methionine was not removed 

15 by expression of rsT4. 113.1 in E.coli , or that sequence 
analysis was impaired by the presence of glutamine 
at the first cycle of the degradative chemistry. 
Sequence analysis was performed for 40 cycles and no 
evidence of lysine carbamylation was observed. Amino 

20 acid analysis displayed a close correlation of actual 

and theoretical values for amino acids, thus indicating 
the marked absence of proteolytic degradation in the 
course of expression, or purification, or both. 

Immunoprecipitation Of CHO Cell 
25 Lines Producing Soluble T4 

We tested the conditioned media from 35 S-Cys 
metabolically labelled CHO cells trans fected with 
one of the T4 mutant constructs pBG377, pBG380, pBG381, 
the full length recombinant T4 construct pBG379, of 

30 this invention or vector only, to determine whether 
any produced a molecule recognized by the anti-T4 
monoclonal antibody 19 Thy. To carry out this test, 
we incubated about 10 7 CHO cells trans fected with 
either pBG380, pBG381, pBG377, pBG379 or pBG312, for 

35 5 hours at 37°C with 180 pCi/ml 35 S-labelled cysteine 
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[DuPont, New England Nuclear] in 4 ml RPMI cys" medium 
(Gibco). After labelling of the cells, 1 ml of fil- 
tered, conditioned media was made 0.5 mM with phenyl - 
methyl-sulphonyl fluoride and immunoprecipitated 
5 with 0KT4 and protein A Sepharose [P. H. Sayre and 
E. L. Reinherz, Eur. J. Immunol. , 15, pp. 291-95 
(1985)]. Subsequently/ we incubated media from the 
35 S-labelled cells with 0KT4 (ATCC #CRL 8002). We 
then immuno-precipitated with protein A Sepharose 

10 and subjected the immuno-precipitates to SDS-PAGE 

under reducing conditions on 10% polyacrylamide gels 
[U. K. Laemmli, Nature , 227, pp. 680-85 (1980)]. 
Autoradiography was carried out with X-Oraat X-ray 
film (Eastman Kodak). 

15 As shown in lanes 3-5 of Figure 32, both 

pBG380 (rsT4.2) and pBG381 (rsT4.3) directed the 
synthesis of a secreted, immune, 35 S-labelled T4 
protein that was recognized by the OKT4 anti-T4 
antibody. The immunoprecipitated truncated raole- 

20 cules migrated as 49 Kd proteins, a result consis- 
tent with their predicted molecular weights. In 
contrast, no soluble T4 antigen could be detected in 
the conditioned media of cell lines stably trans- 
fected with pBG377 (rsT4.1) or pBG379 (rflT4). 

25 immunoprecipitation analysis of cellular extracts of 
cell lines transfected with pBG377 suggests that the 
rsT4.1 gene may be misfolded, which could account 
for a block in its secretion [M. J. Gething et al.. 
Cell , 46, pp. 939-50 (1986)]. 

30 in Figure 32, the lanes represent the 

following: Lane 1 : immunoprecipitation from condi- 
tioned medium of CHO cells stably co- transfected 
with vectors pBG312 and pAdD26. Lane 2 : blank. 
Lanes 3 and 4 : immunoprecipitation from conditioned 

35 medium of CHO cells stably co- transfected with pBG380 
(rsT4.2) and pAdr>26. Lanes 5 and 6 : immunoprecipita- 
tion from conditioned medium of CHO cells stably 
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co- trans fect-ed with pBG381 (rsT4.3) and pAdD26. 
Lane 7 : immunoprecipitation from conditioned medium 
of CHO cells stably co-transfected with recombinant 
full length T4 (pBG379) and pAdD26. In Figure 32, 
5 the arrow indicates the predicted position of the 
soluble T4 from pBG380 or pBG381 relative to the 
migration of standard molecular weight markers. 

Immunoprecipitation Of COS 7 Cell Lines 
Producing Recombinant Soluble T4 

10 We expressed recombinant soluble T4 

derivatives pBG392, pBG393 and pBG394 in COS 7 cells 
by electroporation, essentially as described by 
G. Chu et al. f "Electroporation For The Efficient 
Transfection Of Mammalian Cells With DNA" , Nuc. 

15 Acids Res. , 15, pp. 1311-26 (1987). More specifi- 
cally, we introduced 20 \ig closed circular plasmid 
DNA and 380 ug of carrier (sonicated salmon sperm 
DNA) into 3 x 10 7 COS 7 cells. The cells were 
electroporated using a Gene Fulser (Biorad) set at 

20 300 volts. Subsequently, we incubated the COS 7 

cells in Dulbecco's Modified Eagle's Medium supple- 
mented with 10% fetal calf serum for 24 hours. We 
then harvested the conditioned media, filtered it 
through a Millipore Minidisk 0.22 |J hydrophilic 

25 filter cartridge (Millipore #MCGL 305-01) and 

concentrated the secreted proteins on a fast-S ion 
exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

We then eluted the bound proteins with 

30 20 mM Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution 
pool was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising either 19Thy anti-T4 monoclonal antibody 
and protein A Sepharose or 0KT4A and protein A 

35 Sepharose. We washed the column extensively and 

eluted the bound material as 0.5 ml fractions with 
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50 mM glycine-HCl (pH 2.5), 150 mM NaCl, 0.1 mM EGTA 
and 5 pg/ml Bovine pancreatic trypsin inhibitor, 
Apxotinin (Sigma, #A1153). The immunoprecipitates 
were subjected to SDS PAGE (10% gel) followed by 
5 immunoblotting against rabbit antisera raised 

against peptide JB-1. We employed silver stained 
gels to follow binding and elution of rsT4 during 

chromatography . 

Figure 33 depicts an iramunoblot analysis of 

10 transiently expressed pBG392 (rsT4.7) [lanes 10, 
11]; pBG393 (rsT4.8) [lanes 4, 7, 8] and pBG394 
(rsT4-9) [lane 5]. The standards are 50 ng purified 
rsT4.3 (lane 1); 150 ng purified rsT4.3 (lane 2) and 
250 ng purified rsT4.3 (lane 3). The arrow indicates 

15 the expected position of migration of a protein with 
the relative molecular weight of rsT4.7: 21,000 
daltons. The sample that was to be loaded into lane 4 
was lost and lanes 6 and 9 are blank. 

As shown in lanes 10 and 11 of Figure 35, 

20 pBG392 (rsT4.7) directed the synthesis of a secreted, 
immune protein that was recognized by the anti-T4 
antibodies 0KT4A and 19Thy. Lanes 4, 7 and 8 also 
demonstrate that pBG393 (rsT4.8) directed the 
synthesis of a secreted, immune protein that was 

25 recognized by 0KT4A and 19Thy. This analysis 

illustrates that rsT4.7 contains the OKT4A epitope. 
It also suggests that the binding region for HIV 
envelope binding resides in the amino 182 terminal 
residues of T4. 

30 in contrast, no soluble T4 could be detected 

in the media of cell lines transfected with pBG394 
(rsT4.9) [see lane 5]. Immunoprecipitation analysis 
of cellular extracts of cell lines transfected with 
pBG397, however, showed that rsT4.9 was recognized 

35 by 0KT4A. We believe that rsT4.9, a 113 amino acid 

construct, binds the HIV virus and that it represents 
a second generation soluble T4, one with only two 
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cysteines and one of three disulfide bridges. 
Accordingly, rsT4.9 is easily produced in E.coli or 
yeast systems. 

Similarly, although no soluble T4 could be 
5 detected in the media of cell lines trans fected with 
pBG396 (rsT4.12), analysis of cellular extracts of 
those cell lines showed that rsT4.12 was recognized 
by 0KT4A. Thus, rsT4.12 may also bind HIV virus. 

Radioimmunoassay And Epitope 
10 Analysis Of rsT4.113 

In order to determine if the 113 fragment 
of rsT4 contained structural determinants for binding 
to 0KT4A, Leu- 3 A and OKT4, we then carried out 
radioimmunoassay and epitope analysis of rsT4.113 

15 using a competitive inhibition radioimmunoassay 

[C. J. Newby et al., "Solid-Phase Radioimmune Assays" 
in Handbook Of Experimental Immunology , D. M. Weir 
(Ed.), 1, pp. 34.1-34.8 (1986)]. As 0KT4A and Leu-3A 
block infectivity of HIV in vitro f Dalgleish et al. , 

20 supra] and binding of T4 to gpl20/160 [ McDougal et al . , 
supra] , this analysis served as a first approximation 
as to whether or not rsT4.113 contained structural 
elements for interaction with HIV. 

We first coated U-bottom 96 well microti ter 

25 plates (Falcon) with 50 pl/well goat-anti-mouse IgG 
(Hyclone Typing Kit, Logan, Utah) in PBS (pH 7.0) to 
a concentration of 50 ug/ml and incubated the plates 
overnight at 4°C. We then rinsed the plates with 
IX PBS and blotted them dry. The plates were then 

30 blocked by the addition of 100 pl/well of a IX PBS 
solution containing 5% bovine serum albumin for 
1 hour at room temperature. We rinsed the plates 
with PBS, blotted dry and then spotted them with 
50 pi of one of three antibody solutions containing 

35 either OKT4 (10 pg/ml in block buffer); 0KT4A 
(500 ng/ml in block buffer) or Leu-3A (Becton- 
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Dickinson) (500 ng/ml in block buffer). We let the 
plates stand for 2 hours at room temperature. We 
then washed the plates 3 times with a PBS/0.05% 
Tween-80 solution and 2 times with IX PBS and blotted 
5 them dry. 

in a separate plate, we titrated competitor 
samples of unlabeled rsT4. 113.1 from 20 vg/ml and 
serially diluted twice (including no competitor con- 
trol), with final volumes in each well of 25 

10 The positive control for this assay was competition 
with unlabeled rsT4.3 (375 amino acids). We then 
added 25 pi of 125 I-rsT4.3 containing 10,000 
cpm/25 Ml (prepared according to A. E. Bolton and 
W. M. Hunter, Radioimmunoassay And Related M ethods, 

15 Chapter 2c). Subsequently, we spotted the entire 

50 pi content of each well onto the assay plate con- 
taining each of the three antibody solutions and 
incubated for 2 h at room temperature. We then 
washed the plates 3 times with a PBS/0.5% Tween-80 

20 solution and 2 times with IX PBS, blotted them dry 

and then counted the wells in a Beckman gamma counter 
for radioactivity. 

As shown in Figure 34, rsT4. 113.1 competes 
with 125 I-rsT4.3 for absorption to an 0KT4A solid 

25 phase in a dose- dependent manner. Additionally, 

rsT4. 113.1 competes with 125 I-rsT4.3 for absorption 
to a Leu-3A solid phase in a dose-dependent manner. 
By comparison to unlabeled rsT4.3, rsT4. 113.1 exhibits 
a molar affinity for those antibodies within a factor 

30 of 3. In the 0.4 to 25 vq/ml concentration range 

tested, rsT4.113 did not compete with radiolabeled 

rsT4.3 for binding to OKT4. In a similar assay, we 

12 5 

observed that rsT4.111 also competes with I-rsT4.3 
for binding to 0KT4A and Leu-3A, but not to 0KT4 
35 [Figures 35-37] . 

Based on these results, we believe that 
the epitopes for 0KT4A and Leu-3A are contained within 
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the amino-terminal 113 amino acids of T4. We also 
believe that the epitope for 0KT4 binding is localized 
within the carboxy terminal of the T4 polypeptide. 

Accordingly, we believe that the gpl20- 
5 binding domain is localized within the amino terminal 
113 or 111 amino acids of the T4 protein- Based on 
this belief, we synthesized various synthetic oligo- 
peptides which contain sequence within that structural 
domain. These oligopeptides are represented in 
10 Figure 3 as follows: 

Oligopeptide Amino Acid Coordinates 

JB-1 44-63 
rsT4 #6 18-29 
rsT4 #7 5-56 
15 rsT4 #8 84-97 

rsT4 #9 30-63 
We synthesized these peptides using conventional 
phosphoamide DNA synthesis techniques [ Tetrahedron 
Letters , 22, pp. 1859-62 (1981)]. We synthesized 
20 the peptides on an Applied Bio systems 380A DNA 

Synthesizer and- purified them by gel electrophoresis. 

ELISA Assay For rsT4.113 

We also carried out an ELISA assay for 
rsT4. 113.1 produced by p211-ll-trans formed E.coli . 

25 Throughout this assay, dilutions were made in block- 
ing solution and, between each step, we washed the 
plates with PBS/0.05% Tween-20. More specifically, 
we coated wells of Immulon 2 (Dynatech, Chantilly, 
Virginia) plates with .005 OD (280 nm)/ml of 0KT4 

30 (IgG2b) in 0.05 M bicarbonate buffer to a volume of 
50 ul/well and incubated the plates overnight at 
4°C. We then blocked the plates with 5% bovine 
serum albumin in PBS, 200 ul/well, and incubated for 
30 minutes at room temperature. 

35 Subsequently, we added 50 pi of 50 ng/ml 

rsT4.3 to each well, incubating overnight at 4°C. 
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We then added 50 pl/well of a mixture containing 
rsT4. 113.1 and 10 n g/ml of 0KT4A and incubated for 
2 1/2 hours at room temperature. Using a Hyclone 
Kit (Hyclone), we then carried out the following 
5 steps. First, we added 1 drop of rabbit anti-mouse 
IgG2a to each well and incubated the plates for 
1 hour at room temperature. We then added 100 pi of 
peroxidase-labeled anti-rabbit IgG, diluted 1:4000 
with blocking buffer to each well, and incubated for 

10 1 hour at room temperature. 

We prepared a substrate reagent as follows. 
We diluted substrate reagent 1:10 in distilled water 
and added two O-phenyl-ethylene-diamine ("OPD" ) 
chromophore tablets per 10 ml of substrate. We let 

15 the mixture dissolve thoroughly by mixing with a 

vortex. Alternatively, a TMB peroxidase substrate 
system (Kirkegaard & Perry Catalogue #50-76-00) may 
be used. Subsequently, we added 100 pi of the 
chromophore solution to each well, incubated for 

20 10-15 minutes at room temperature and then stopped 
the color development with 100 pi of 1H H 2 S0 4 . We 
then measured OD at 490 nm, using an EL ISA plate 
reader. 

The results of the assay are demonstrated 

25 in Figure 38. 

We then subjected the soluble T4 proteins 
produced by the T4 constructs of this invention to 
various functional assays. 

Assays Of The Antiviral Acti vity Of Soluble T4 
30 The antiviral activity of soluble T4 

according to this invention was evaluated using 
modifications of various in vitro systems used to 
study antiviral agents and neutralizing antibodies 
[D. D. Ho et al., "Recombinant Human Interferon Alpha 
35 (A) Suppresses HTLV-III Replication In Vitro", 
Lancet , pp. 602-04 (1985); K. Hartshorn et al . , 
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"Synergistic Inhibition Of HTLV-III Replication 
In Vitro By Phosphonoformate And Recombinant Inter- 
feron Alpha-A", Antimicrob Ag Chemoth , 30, pp. 189-91 
(1986)]. 

5 For each of these assays, we prepared graded 

concentrations of soluble T4 and preincubated them 
with an H9 derived II IB isolate of HIV [a gift from 
Drs. M. Popovic and R. Gallo, National Cancer 
Institute, Bethesda, Maryland]. The isolate was 
10 maintained as a chronically infected culture in H9 
cells. Cell-free HIV stocks were obtained from 
supernatant fluids of HTLV-III infected H9 cultures 
(culture conditions: 1 x 10 6 cells/ml with 75% viable 
cells ) . We prepared serial 10 fold dilutions of 
15 recombinant soluble T4 ranging from 10 picograms/ml 
to 10 micrograms/ml and incubated them with fifty 
50% tissue culture infectious doses (TCID 5Q ) of HIV 
for 1 hour at 37°C, in RPMI-1640 supplemented with 
20% heat inactivated fetal calf serum (FCS). We 
20 then added 150 ul of H9 cells to a final concentra- 
tion of 0.5 x 10 6 cells/ml which were not HIV-infected 
to the wells containing aliquots of the recombinant 
soluble T4/HIV mixture. 

We adjusted each virus inoculum to a con- 
25 centration of 250 TCID 50 /ml. We preincubated 100 pi 
of the virus inoculum with 200 ul recombinant solu- 
ble T4 or 100 pi immunoglobulin prepared in tripli- 
cate serial 2 -fold dilutions for 1 hour at 37 °C 
prior to inoculation onto 1.5 - 2 x 10 6 H9 cells in 
30 5 ml RPMI 1640 supplemented fetal calf serum (20%), 
HEPES (lOmM), penicillin {250 U/ml), streptomycin 
(250 ug/ml) and L-glutaraine (2mM) . On days 5, 6, 1, 
10 and 14, we examined each culture for characteris- 
ic cytopathic effects ("CPE"). Neutralization was 
35 defined as the inhibition of syncytia formation com- 
ared with controls. 
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The positive control used was HIV seroposi- 
tive neutralizing serum, as described in D. D. Ho 
et al., "Human immunodeficiency Virus Neutralizing 
Antibodies Recognize Several Conserved Domains On 
5 The Envelope Glycoproteins", J. Virol., 61, 

pp. 2024-28 (1987). The negative controls used 
were HIV seronegative serum only and buffer only. 

Cytopathic Effect Assay (CPE) 

In this assay, following conventional 
10 protocols for cytopathic effect assays rKlatzmann 

et al. (1984), supra and Wonq-Staal and Gallo (1985), 
supra], we microscopically examined the H9 cells for 
evidence of cytopathic effects of HIV. 

The CPE was scored on a four point scale 
15 from 1+ to 4+, with 4+ representing the highest 
degree of CPE. 

By day 14, wells containing recombinant 
soluble T4 according to this invention (rsT4.2, 
derived from the pBG380 trans fected CHO cell line 
20 BG380) at 10 pg/ml showed no evidence of CPE, while 
the negative control showed 1+ to 3+ CPE. 

p24 Radioimmunoassay 

We then tested soluble T4 as an inhibitor 
of viral replication in an HIV virus replication 

25 assay according to D. D. Ho et al., J . Virol . , 61, 
pp. 2024-28 (1987) and J. Sodroski et al., Nature , 
322, pp. 470-74 (1986). We carried out the assay 
essentially as described, except that the cultures 
were propagated in microtiter wells containing 

30 200 pi- In assay, we evaluated the ability of 

the soluble T4 polypeptides of this invention to 
block HIV replication, as measured by HIV p24 
antigen production. We sampled supematants twice 
weekly for HIV p24 antigen as described below. 
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We obtained an assay kit [HTLV-III p24 
Radioimmunoassay System, Catalogue No. NEK-040, 
NEK-040A, Biotechnology Systems, New Research 
Products, Dupont] which contains affinity purified 
5 125 I labelled HIV p24 antigen, a rabbit anti-p24 

antibody and a second goat anti-rabbit antibody which 
is used to precipitate antigen- antibody complexes. 
We carried out the assay according to the protocol 
included with the kit. Accordingly, we mixed a sample 
10 to be assayed or one of a series of amounts of 

125 

unlabelled p24 antigen with a fixed amount of I 
labelled p24 and a fixed limited amount of rabbit 
anti-p24 antibody. We incubated the samples over- 
night at room temperature and then added a goat 
15 anti-rabbit immunoglobulin preparation for 5 minutes 

at 40 °C. We centrifuged the samples in a micro fuge 

125 

and aspirated the supernatant fluid. Pelletted I 

labelled p24 was quantitated for each sample by gamma 

125 

counting and a standard curve for the I p24 dis- 
20 placed by the known amounts of antigen added to 

standard tubes was constructed. We then calculated 
the 12 5 I labelled p24 displaced by the antigen present 
in the unknown samples by interpolation using the 
standard curve constructed from the known amounts of 
25 p24 antigen contained in the standard samples . The 
results are shown in the table below. 
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P24 ASSAY OF HIV REPLICATION INHIBITION 



rsT4 . 2 
Day fug/ml) 



0.5* 
5.0** 



10 



Patient 
Serum 


Average 
CPM 


% Bound/ 
Unbound 


Negative 
Positive 


344 

Z , £3 / 

551 
1,766 


8.5 
112 4 
19.9 
86.6 


Negative 
Positive 


230 
2,459 
322 
1,980 


2.2 
124.6 
7.3 
96-3 


Negative 
Positive 


221 
2,284 

246 
1,988 


1.8 
115.0 
3.1 
98.7 



10 0.5* 
5.0** 

14 

0.5* 

15 5.0** 

These results demonstrate that soluble T4 
according to this invention at a concentration of 
5 ug/ml completely inhibits virus replication as 
measured in this standard 14 day assay. These 

20 results are also depicted in Figure 39 in graphic 
form. In Figure 39, values were calculated from a 
standard curve of p24 according to assay Jcit 
instructions . 



25 * This concentration was initially believed to be 
1.0 vg/ml, based upon our preliminary approximation 
that 1 unit of absorbance at 280 nm ("A 280 " ), was 
equivalent to 1 mg of rsT4.2. Absorbance at 280 nm 
is a commonly used first approximation of protein 

30 concentration. Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
1 A unit of rsT4.2 being equivalent to 0.5 mg of 
the 2 Brotein. 

35 ** This concentration was initially believed to 

be 10 ug/ml, based upon our preliminary approximation 
that 1 unit of absorbance at 280 nm < f, A 280 ")/ was 
equivalent to 1 mg of rsT4.2. Absorbance at 280 nm 
is a commonly used first approximation of protein 

40 concentration. Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
1 A unit of rsT4.2 being equivalent to 0.5 mg of 
the 2 §rotein. 
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We then carried out a p24 replication assay 
as described above, except that the soluble T4 was 
added to the infected cultures during refeeding at 
days 3, 1 and 10, in order to maintain a constant 
5 rsT4 concentration throughout the infection period. 
The results of this assay are shown in the table 
below. 

INHIBITION OF HIV REPLICATION 
WITH CONSTANT CONCENTRATION OF rsT4 



10 rsT4.2 p24 

(nq/ml) (ng/ml) 

0.008 770 

0.031 970 

0.125 85 

15 0.5 0 

5.0 0 

0 1120 

uninfected 0 



These results demonstrate that when solu- 
20 ble T4 protein according to this invention was main- 
tained at a constant concentration throughout the 
infection period, as little as 0.125 of the 

protein substantially blocked replication of 250 
TCID 50 /ml of HIV-1. 
25 Advantageously, soluble T4 protein accord- 

ing to this invention, at concentrations far exceed- 
ing those required to block viral replication, did 
not exert immuno toxic effects in vitro , as measured 
by three lymphocyte proliferation assays — mixed 
30 lymphocyte response, phytohemagglutinin, and tetanus 
toxoid stimulated response. 

Syncytia Inhibition Assay 

To further assess the effect of soluble T4 
on HIV env-T4 binding, we evaluated the effect of two 
35 preparations of our soluble T4 protein on the syn- 
cytiagenic properties of HIV in the co-cultivation 
assay. We carried out a C8166 cell fusion assay 
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as described in B. D. Walker et al . , Proc. Natl. 
Acad. Sci. USA , 84, pp. 8120-24 (1984). 

We incubated 1 x 10 9 H9 cells chronically 
infected with HTLV-IIIB for 1 hour at 37°C in 5% 
5 C0 2 with various concentrations of one of two 

preparations of rsT4.2 in 150 M l RPMI-1640 media 
supplemented with 20% fetal calf serum. We then 
added 3 x 10 4 C8166 cells in 50 \il media (a T4 
transformed human umbilical cord blood lymphocyte 

10 line f sodroski et al. , supra], to a final volume of 
0.2 ml in each well. Final well concentrations of 
soluble T4 were 0.5 pg/ml* and 5.0 pg/ml* for prepa- 
ration #1 and 1.25 pg/ml* and 12.5 pg/ml* for prepa- 
ration #2. We then counted total number of syncytia 

15 per well at 2 hours and 4 hours after adding the 

C8166 cells at 37°C in 5% C0 2 - Parallel co-cultiva- 
tions used buffer alone (negative control) or 0KT4A 
at 25 pg/ml (positive control) as controls. We con- 
sidered a positive result as a 50% reduction in 
- 20 syncytia compared to controls, at a time when at 
least 100 syncytia per 10 4 infected H9 cells were 
present in the control cultivations. The results of 
this assay are shown below and in Figure 40 (2 hour 
data). 



* These concentrations were initially believed 
to be, respectively, 1 pg/ml, 10 pg/ml, 2.5 pg/ml 
and 25 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ? U 
30 was equivalent to 1 mg of rsT4.2. Upon aminG acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A 2SQ unit of rsT4.2 being 
equivalent to 0.5 mg or the protein. 



WO 89/01940 



PCT/US88/02940 



-80- 

INHIBITION IN C8166 FUSION ASSAY 

% Inhibition * 



Preparation 


TrsT4.21 <|jg/ml). 


2 Hrs 


4 Hrs 


buffer 


0 


0 


0 


rsT4.2 


0.5** 


30 


42 


rsT4.2 


5.0** 


54 


47 


rsT4.2 


1.25** 


16 


21 


rsT4.2 


12.5** 


77 


55 


0KT4A (25 pg/ml) 


0 


100 


100 



10 As demonstrated in this table and in Fig- 

ure 40, soluble T4 according to this invention at 
5.0 pg/ml and 12.5 \ig/ml inhibited syncytia formation 
at 2 hours, as compared to buffer alone. By 4 hours 
after the addition of C8166 cells, soluble T4 at 

15 12.5 vg/ml continued to inhibit greater than 50% 
syncytia formation, as compared to the negative 
control . 

We also evaluated the effect of two prep- 
arations of our soluble T4 protein rsT4.7 on the 
20 syncytiagenic properties of HIV in a similar co- 
cultivation assay. The results of this assay are 
shown below. 



* All assays were carried out in triplicate, and 
25 the number of syncytia counted per well was averaged 
to calculate % inhibition. The % inhibition repre- 
sents the difference between the average number of 
syncytia in the negative control (without rsT4 or 
OKT4A) and the average number of syncytia counted 
30 when either rsT4 or 0KT4A were present during the 
assay, divided by the average syncytia count for 
the negative control and multiplied by 100. 

** These concentrations were initially believed 
to be, respectively, 1 pg/ml, 10 Mg/nil, 2,5 fjg/ml 

35 and 25 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ("A-qq")/ 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein however, we found that it 
had a higher extinction coefficient than originally 

40 approximated, with 1 °f rsT4.2 being 

equivalent to 0.5 mg or the protein. 
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INHIBITION IN C8166 FUSION ASSAY 



5 Preparation 

H9 cells 
(control ) 

C8166 cells 
(control) 

10 HIV-infected 
H9 cells 
added to 
C8166 cells 
(control ) 

15 0KT4A 

(control ) 

Prep. 1 of 
rsT4 . 7 



rsT4.7 



Average 
Syncytia/50pl 
aliquot 



118 



% Inhibition 
at 2 Hrs 

N/A 
N/A 



= 5.0* 



43 



100 



63.6 



20 * This concentration */as initially believed to 
be 10 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ("A^gg")' 
was equivalent to 1 mg of rsT4.2. Upon amino" acid 
analysis of the protein, however, we found that it 

25 had a higher extinction coefficient than originally 
approximated, with 1 A^q unit of rsT4.2 being 
equivalent to 0.5 mg or^the protein. 
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Average 

rsT4.7 Syncytia/SOpl % Inhibition 
Preparation (ug/ml ) aliquot at 2 Hrs 

5 H9 cells 0 0 N/A 

(control) 

C8166 cells 0 1 N/A 

(control) 

HIV-infected 0 141 0 

10 H9 cells added 
to C8166 cells 
( control ) 

0KT4A (control) 0 0 100 

Prep. 2 of 

15 rsT4.7 = 5.0* 27 80.9 



* This concentration was initially believed to 
be 10 Mg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm (" A 280 U ?' 
20 was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A 28Q unit of rsT4.2 being 
equivalent to 0-5 mg or the protein. 
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fl fig *v date? dav 14 



Preparation 

5 H9 cells 
(control ) 

C8166 cells 
(control ) 

HIV-infected 
10 H9 cells added 
C8166 cells 
( control ) 

0KT4A (control) 

Prep. 1 of 
15 rsT4.7 

Prep. 2 of 
rsT4 . 7 



Average 
rsT4.7 Syncytia/50Ml 
(uq/ml ) aliquot 



= 5.0* 



= 5.0* 



128 



35 



% Inhibition 
at 2 Hrs 

N/A 



N/A 



100 

72.7 

98.4 



20 



25 



30 



35 



As demonstrated in these tables, soluble 
T4 protein rsT4.7 inhibited syncytia formation in 
HIV-infected H9 cells. 

We also evaluated the effect of rsT4. 113.1 
and rsT4.111 on the syncytiagenic properties of HIV in 
a co-cultivation assay. We carried out a C8166 cell 
fusion assay as described in Walker et al . , supra. 

We incubated 1 x 10 4 H9 cells chronically 
infected with HTLV-IIIB for 1 hour at 37°C in 5% 
C0 2 , with from 5 to 50 pg/ml rsT4. 113.1 or rsT4.111 
in 150 [il KPMI-1640 media supplemented with 20% 
fetal calf serum in 96-well microti ter plates. We 



* This concentration was initially believed to 
be 10 yg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm (" A 28o" ?' 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A unit of rsT4.2 being 
equivalent to 0.5 mg or the protein. 
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then added 3 x 10 4 C8166 cells to the wells in 50 pi 
aliquots. The plates were incubated for 2 hours at 
37° C in 5% C0 2 and, following this incubation, the 
number of syncytia per well were counted. 
5 Syncytia were defined as cells containing 

a ballooning cytoplasm greater than three cell 
diameters. All samples were counted twice. Parallel 
co-cultivation used 0KT4A alone or rsT4-3 alone at a 
concentration of 25 pg/ml (positive controls) or H9 
10 cells alone or C8166 cells alone (negative controls). 
The results of this assay are shown below and in 
Figure 41. 



INHIBITION IN 


C8166 FUSION 


ASSAY 




Preparation 


rsT4 ( m g/ml ) 


% 


Inhibition 


H9 cells (control) 


0 




0 


C8166 cells (control) 


0 




0 


rsT4. 113.1 


1.25 




35 


rsT4. 113.1 


2.5 




63 


rsT4 . 113 . 1 


4.25 




63 


rsT4. 113-1 


6.25 




82 


rsT4 . 113 . 1 


12.5 




96 


rsT4.3 


12.5 




100 


0KT4A (25 \iq/ml) 


0 




100 



As demonstrated in this table and in 
25 Figure 41, rsT4. 113.1 exhibited a dose- dependent 

inhibition of HIV- induced syncytia formation. The 
molar specific inhibitory activity of rsT4. 113.1 
appeared to be reduced by an order of magnitude by 
comparison to anti-viral activity of longer forms of 
30 recombinant soluble T4. Thus, whereas rsT4. 113.1 is 
effective toward neutralization of HIV-dependent 
cell fusion in vitro, its molar specific inhibitory 
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activity is decreased by a factor of 10. It is 
undetermined whether this decreased potency is due 
to incomplete renaturation of the E . coli -derived 
protein, the presence of three additional amino 
5 acids at the N-terminus of rsT4. 113.1 (Met-Gln-Gly ) 
lacking in rsT4.2 or rsT4.3 produced in mammalian 
cells, or the absence of additional structure in 
rsT4. 113.1 required for high-affinity binding to 
HIV. 

10 we also carried out a C8166 cell fusion 

assay with rsT4.111, as described for rsT4. 113.1. 
The results of this assay are shown below. 

INHIBITION IN C8166 FUSION ASSAY 
Preparation rsT4(Mg/ml) % Inhibition 

15 H9 cell (control) 0 0 

C8166 cells (control) 0 0 

rsT4.111 1*25 0 

rsT4.Hl , 2.5 40 

rsT4.111 4.25 20 

20 rsT4.1H 6.25 67 

rS T4.111 12.5 100 

rsT4.111 25.0 100 

rsT4.3 12.5 100 

rsT4.3 25.0 100 

25 OKT4A (25 \iq/ml) 



0 100 



As demonstrated in this table, rsT4.111 
exhibited a dose- dependent inhibition of HIV-induced 
syncytia formation. At a concentration of 12.5 Mg/ml 
and 25.0 vg/ml, complete inhibition of cell fusion 
30 was achieved. 

Kinetics Of Intramuscular Injection Of Soluble T4 

We examined the kinetics of the appearance 
of a recombinant soluble T4 protein according to 
this invention (specifically, rsT4.3 from the pBG381- 
35 transfected cell line BG381) in serum after intra- 
muscular injection as follows. 
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We obtained two cynomolgus monkeys ( Macaca 
fascicularis ) who were free of infectious disease 
and in good health. Each monkey had been subjected 
to a 6 week quarantine period prior to administration 
5 of the soluble T4 protein. Throughout the adminis- 
tration period, each monkey was maintained on a con- 
ventional diet of monkey chow supplemented with fresh 
fruit. A catheter and a vascular access port were 
surgically placed in a femoral vein of each animal 

10 prior to treatment in order to facilitate blood 
collection. 

Over a period of 28 days, each animal 
received recombinant soluble T4 protein twice daily 
by intramuscular injection to the large muscles of 

15 the thighs or buttocks. Injections were administered 
to each animal 8 hours apart and each injection con- 
tained a volume of 0.15 ml/kg (0.25 mg/kg) of rsT4.3 
(from the pBG3 81- trans formed cell line BG381), for a 
total dose of 0.5 mg/kg/day/monkey . Serum samples 

20 for clearance determination were collected on day 1 
before the first treatment and at 1, 2, 4 and 
8 hours after the first injection, as well as 1, 2, 
4, 14 and 16 hours after the second injection on 
days 7, 14 and 28. 

25 We found that intramuscularly injected 

soluble T4 reached the maximum level in serum between 

1 and 2 hours after injection, with the level falling 
off slowly and reaching half -maximum value at approxi- 
mately 6 hours post- injection. According to data 

30 obtained for intravenous administration (not shown), 
the level of rsT4.3 in serum should drop below that 
attained via intramuscular injection aproximately 

2 hours after intravenous injection. Thus, while 
the maximum rsT4.3 level in serum after intramuscular 

35 injection does not reach that attainable via intra- 
venous injection, it is slowly released into the 
blood stream, remaining detectable in serum for a 
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much longer time. This slow release mechanism asso- 
ciated with intramuscular routes of injection is 
advantageous because a higher level of soluble T4 
protein is available over a longer period of time 
5 over a given concentration; thus remaining in a sus- 
tained level. Intramuscular administration of solu- 
ble T4 protein is particularly useful in treating 
early stage HIV-infected patients, to prevent the 
virus from disseminating, or in treating patients 
10 who have been exposed to the virus and who are not 
yet seropositive. 

We determined serum levels of rsT4.3 using 
an ELISA assay. Throughout this assay, dilutions 
were made in blocking solution and, between each 
15 step, we washed the plates with PBS/0.05% Tween-20. 
More specifically, we coated wells of Immulon 2 
plates with .01 OD (280 nm)/ml of 0KT4 (IgG2b) in 
0.05 M bicarbonate buffer to a volume of 50 fjl/well 
and incubated the plates overnight at 4°C, We then 
20 blocked the plates with 5% bovine serum albumin in 
PBS, 200 pi/well, and incubated for 30 minutes at 
room temperature. 

Subsequently, we added 50 pi of sample or 
standard to each well, incubating for 4 hours at 
25 room temperature. We then added 50 M l/well of OKT4A 
at 0.1 Mg/ml and incubated overnight at 4°C. Using 
a Hyclone Kit (Hyclone) we then carried out the fol- 
lowing steps. First, we added 1 drop of rabbit 
anti -mouse IgG2a to each well and incubated the 
30 plates for 1 hour at room temperature. We then added 
100 pi of peroxidase-labeled anti-rabbit IgG, diluted 
1:4000 with 5% BSA/PBS to each well, and incubated 
for 1 hour at room temperature. 

We prepared a substrate reagent as follows. 
35 We diluted substrate reagent 1:10 in distilled water 
and added two O-phenyl-ethylene-di amine ("OPD") 
chromophore tablets per 10 ml of substrate. We let 
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the mixture dissolve thoroughly by mixing with a 
vortex. Alternatively, a TMB peroxidase substrate 
system (Kirkegaard & Perry Catalogue #50-76-00) may 
be used- Subsequently, we added 100 pi of the 
5 chromophore solution to each well, incubated for 
10-15 minutes at room temperature and then stopped 
the color development with 100 pi of IN H 2 S0 4 - We 
then measured OD at 490 nm, using an ELISA plate 
reader . 

10 The results of the assay are demonstrated 

in the tables below. 
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10 



15 



Monkey #7-91 



Time(hr ) 


Day 1 


0 


22.7 


1 


278 .8 


2 


281.8 


4 


214.9 


5 




3 


72.3 


9** 


246.2 


10 


259.6 


12 


136.0 


22 


23.8 


24 


13.4 
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rsT4 Level 
( na/ml ) 

Day 7 Day 14 Day 28_ 

96.5 158.0 19.8 

199.6 360.7 238.3 

366.8 306.4 441.1 

246.6 363.9 393.2 

290.4 

105.0 199.4 



Monkey #7-92 



Time(hr) Day 1 

20 0 6.7* 

1 87.2 

2 254.2 
4 170.0 
5 

25 8 118.9 

9** 405.1 

10 523.5 

12 371.5 
22 48.4 

30 24 39.4 



rsT4 Level 
{ nq/ml ) 



Day 7 


Day 14 


Day 28 


56.0 


106.3 


60.9 


225.8 


178.0 


437.7 


377.9 


253.2 


770.6 


167.3 


308.2 


821.5 




898.3 


101.2 


176.5 





* - background 

** - second injection administered after the 
collection of the 8 hour sample. 



Polyvalent Forms Of Recombinan t Soluble T4 
3 5 Receptors may be characterized by their 

affinity for specific ligands, such that, at equili- 
brium, the intrinsic affinity (K ft ) between monovalent 
receptor and monovalent ligand can be defined as 
[ RL ]/[R f ][L f ], ^ere [RL] is the concentration of 
40 receptor (R) bound to ligand (L) and [R f ] and [L f ] 
are the concentrations of free receptor and ligand, 
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respectively [P. A. Underwood, in Advances In Virus 
Research , ed. K. Maramorosch et al., 34, pp. 283-309 
(1988)] . 

For a polyvalent receptor (with a valency 
5 of n) binding to a polyvalent ligand (with a valency 
of m), a functional affinity can be defined as 
n[R b ]/n[R f ]m[L f ], where [E^] is the concentration of 
bound receptor sites, and n[R f ] and m[L f ] are, respec- 
tively, the concentrations of free receptor and 
10 ligand binding sites. The effect of increasing the 
valence (the number of binding sites) is to enhance 
the stability of ligand-receptor complexes. The 
affinity of a polyvalent receptor for a polyvalent 
ligand will depend on three factors: the intrinsic 
15 association constant of each binding site, the 

valency (number of binding sites) and the topico- 
logical relationship between the receptor and ligand 
binding sites. Under some circumstances, polyvalent 
binding interactions will lead to higher functional 
20 affinity. The decreased dissociation rate of poly- 
valent ligands with polyvalent receptors results in 
an increased functional affinity [C. L. Hornick and 
F. Karush, Immuno chemistry , 9, pp. 325-40 (1972); 
I. Ottemess and F. Karush, "Principles Of Antibody 
25 Reactions", in Antibody As A Tool , ed. J. J. 

Marchalonais and G.W. Warr, pp. 97-137 (1982)]. 

The simplest case for receptor polyvalency 
increasing functional affinity is represented by a 
bivalent soluble receptor, such as an antibody 
30 molecule, which has two identical ligand binding 

sites, each capable of independently binding antigen 
with equal affinity. If the antigen is displayed 
polyvalently, for example, chemically coupled to a 
solid support such that the spacing between antigenic 
35 sites can. be bridged by the antibody's two antigen 

binding arms, the functional affinity of the antibody 
for the antigen coupled to the solid support would be 
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greater than the intrinsic affinity of the antibody 
binding site for the monovalent antigen [D. Crothers 
and H. Metzger, Immunochemistry , 9, pp. 341-57 
(1972)]. Because virus particles represent poly- 
5 valent antigens, the greater functional affinity of 
antibodies for polyvalent antigens is an important 
factor for antibody-directed virus neutralization. 

The association of recombinant soluble T4 
and the HIV major envelope glycoprotein gp!20 is an 

ra example of monovalent receptor binding to monovalent 
ligand. The affinity of this interaction has been 
measured, and the association between T4 and gpl20 
has a dissociation constant K d = 4 x 10 M [L . Lasky 
et al., Cell , 50, pp. 975-88 (1987)]. 

15 Using the antibody analogy, we believe 

that polyvalent rsT4 will demonstrate a greater 
affinity for HIV-infected cells displaying gpl20 
than monovalent rsT4 and the topicological relation- 
ship between gpl20 on the virus particle or the 

20 infected cell surface, will determine the degree to 
which polyvalent rsT4 exhibits higher functional 
affinity than monovalent rsT4. One example of a 
polyvalent rsT4 is described below, with respect to 
the production of a recombinant bivalent rsT4 con- 

25 sisting of two tandem repeats of amino acids 3-178, 

followed by the C-terminal 199 amino acids of rsT4.3. 
According to this invention, a "polyvalent" receptor 
possesses two or more binding sites for a given 
ligand. Furthermore, the intrinsic affinity of each 

30 ligand binding site of a given polyvalent receptor 
need not be identical. 

As shown in Figure 42, to construct bivalent 
rsT4, we digested pBG391 with Nhe l, which cleaves 
after the valine at position 178 in rsT4, and removed 

35 the Nhe l 5 1 overhang with mung bean nuclease. Next, 
we cleaved with Bo_lII to remove the C-terminal half 
of the rsT4 coding sequence in pBG3 91. Finally, we 
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ligated a Dra l- Bgl H fragment containing the coding 
sequence for rsT4 amino acids 3 (lysine) through 377 
(isoleucine) to the cleaved pBG391 to create pBiv.l, 
a plasmid coding for a fusion protein with a tandem 
duplication of the N- terminal 176 amino acids of 
rsT4, followed by the C-terminal 199 amino acids of 
rsT4.3. The protein produced by this plasmid, 
therefore, contains two adjacent N-terminal gpl20- 
binding or 0KT4A-binding domains (defined by amino 
acid residues 3 through 111 of rsT4.Hl), followed 
by one 0KT4-binding C-terminal domain (Figure 43). 

pBiv.l was transfected by electroporation 
into COS 7 cells to test expression of the bivalent 
rsT4 protein. Three days later, we tested the con- 
ditioned medium of the transfected cells for the 
presence of the rsT4 bivalent protein by immuno- 
precipitation, followed by Western blot analysis of 
the precipitated protein. Both 0KT4A and 0KT4 were 
used for immuno-precipitation to determine that the 
OKT4 epitope and at least one of the 0KT4A epitopes 
had folded correctly. Both antibodies precipitated 
a protein of the predicted apparent molecular weight 
(60,000d) from the conditioned medium of the cells. 

Bivalent rsT4 may be purified by immuno- 
affinity purification from an 0KT4 column and the 
purified protein may then be used to perform quanti- 
tative competition assays with rsT4.3. We believe 
that the bivalent molecule would demonstrate equi- 
valent competition against rsT4.3 for 0KT4 binding, 
but significantly greater competition against mono- 
valent rsT4 for OKT4A binding. The ability of 
bivalent recombinant soluble T4 to block syncytium 
formation may also be demonstrated in the C8166 
fusion assay. We also believe that bivalent 
recombinant soluble T4 would block syncytium 
formation at significantly lower concentrations 
than monovalent rsT4; based upon the higher 
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functional affinity of bivalent recombinant 
soluble T4 for gpl20. 

According to alternate embodiments of this 
invention, other methods for producing polyvalent 
5 rsT4 may be employed. For example, polyvalent rsT4 
may be produced by chemically coupling rsT4 to any 
clinically acceptable carrier molecule, a polymer 
selected from the group consisting of Ficoll, poly- 
ethylene glycol or dextran, using conventional 
La coupling techniques. Alternatively, rsT4 may be 
chemically coupled to biotin, and the biotin-rsT4 
conjugate then allowed to bind to avidin, resulting 
in tetravalent avidin/biotin/rsT4 molecules. And 
rsT4 may be covalently coupled to dinitrophenol 
15 (DNP) or trinitrophenol (TNP) and the resulting 

conjugate precipitated with anti-DNP or anti-TNP- 
igm, to form decameric conjugates with a valency of 
10 for rsT4 binding sites. 

Alternatively, a recombinant chimeric 
20 antibody molecule with rsT4 sequences substituted 
for the variable domains of either or both of the 
immunoglobulin molecule heavy and light chains may 
be produced.' Because recombinant soluble T4 
possesses gpl20 binding activity, the construction 
25 of a chimeric antibody having two soluble T4 domains 
and having unmodified constant region domains could 
serve as a mediator of targeted killing of HIV- 
infected cells that express gpl20. 

For example, chimeric rsT4/IgG 1 may be 
30 produced from two chimeric genes « an rsT4/human 
kappa light chain chimera (rsT4/C kappa ) and an 
rsT4/human gamma 1 heavy chain chimera 

(r<zTA/r ). Both C, and C: m __ 1 regions 

( rsi4/t -gamma-l' kappa gamma- 1 

have been isolated from human recombinant DNA 
35 libraries, and each has been subcloned into animal 
cell selection vectors containing either the 
bacterial neo resistance or bacterial gpt markers 
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for selection in animal cell hosts against the 
antibiotic G418 or mycophenolic acid, respectively. 

To construct rsT4/C gajrana . 1 and rsT4/C kappa 
chimeric genes, an rsT4 gene segment, including at 
5 least the secretory signal sequence and the N-terminal 
110 amino acid residues of the mature rsT4 coding 
sequence and including a splice donor or portion 
thereof, is placed upstream of the gamma-1 and kappa 
constant domain exons . A suitable restriction 
10 enzyme may be used to cut within the intron down- 
stream of the desired rsT4 coding sequence, thus 
providing a donor splice site. Subsequently, a 
suitable restriction enzyme is used to cut within 
the introns upstream of the kappa and gamma-1 
15 coding regions. The rsT4 sequence is then joined to 
the kappa or gamma-1 constant region sequence, such 
that the rsT4 intron sequence is contiguous with the 
gamma-1 and kappa introns. In this way, an acceptor 
splice site is provided by the kappa or gamma-1 
20 constant region intron. Alternatively, rsT4 chimeric 
genes may be constructed without the use of introns, 
by fusing a suitable rsT4 cDNA gene segment directly 
to the gamma-1 or kappa coding regions. 

The rsT4/C gamma _ 1 and rsT4/C kappa vectors 
25 may then be co trans fee ted, for example, by electro- 
poration into lymphoid or non-lymphoid host cells. 
Following transcription and translation of the two 
chimeric genes, the gene products may assemble into 
chimeric antibody molecules . 
30 Expression of the chimeric gene products 

may be measured by an enzyme-linked immunoadsorbant 
assay (ELISA) that utilizes monoclonal anti-T4 anti- 
body 0KT4A, as described infra, or in gpl20 competi- 
tion assays and radioimmunoassays, as described infra. 
3 5 Activity of the rsT4/IgG 1 chimeras may be measured 
by incubating them with HIV-infected cells in the 
presence of human complement, followed by quantitating 
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subsequent complement-mediated lysis of these cells. 
Alternatively, activity may be measured in HIV repli- 
cation and HIV syncytium assays as described infra. 

In order to determine if bivalent rsT4 has 
5 a greater potency than monovalent rsT4, we mixed 
0KT4, at various concentrations, together with a 
constant concentration of rsT4, so that the molar 
ratio of 0KT4:rsT4 varied between 0.2 and 4. After 
preincubating the mixture overnight at 4°C, we added 
10 aliquots to the HIV syncytium assay described infra. 
0KT4 has no observable effect in this assay when used 
alone. In addition, the concentration of recombinant 
soluble T4 chosen did not cause inhibition in this 
assay. Accordingly, we looked for indications that 
15 the OKT4/rsT4 mixture was more potent than rsT4 alone. 
We observed that at ratios of 0KT4:rsT4 greater than 
0.2, partial to complete inhibition of syncytium 
formation occurred. We believe that under conditions 
where two rsT4 molecules are bound to 1 0KT4 molecule, 
20 the greatest inhibitory effect should be found. 

Thus, polyvalent, as well as monovalent 
forms of recombinant soluble T4 are useful in the 
compositions and methods of this invention. 

Microorganisms and recombinant DNA mole- 
25 cules prepared by the processes of this invention 

are exemplified by cultures deposited in the In Vitro 
international, Inc. culture collection, in Linthicum, 
Maryland, on September 2, 1987, and identified as: 
BG378: E.coli MCl061/pBG378 
30 199-7: E.coli MC1061/pl99-7 

170-2: E.coli JA221/pl70-2 
EC100: E.coli JM83/pEC100 
BG377: E.coli MC1061/pBG377 
BG380: E.coli MC1061/pBG380 
35 BG381: E.coli MC1061/pBG381 

These cultures were assigned accession 
numbers IVI 10143-10149, respectively. 
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In addition, microorganisms and recombinant 
DNA molecules according to this invention are exempli- 
fied by cultures deposited in the In Vitro Interna- 
tional, Inc. culture collection, in Linthicum, 
5 Maryland, on January 6, 1988, and identified as: * 

BG-391: E.coli MC1061/pBG391 

BG-392: E.coli MC1061/pBG392 ? 
BG-393: E.coli MC1061/pBG393 
BG-394: E.coli MC1061/pBG394 
10 BG-396: E.coli MC1061/pBG396 

203-5 : E.coli SG936/p203-5 . 
These cultures were assigned accession 
numbers IVI 10151-10156, respectively. 

Microorganisms and recombinant DNA mole- 
15 • cules according to this invention are also exempli- 
fied by cultures deposited in the In Vitro 
International, Inc. culture collection, in Linthicum, 
Maryland, on August 24, 1988 and identified as: 
211-11: E.coli A89/pBG211-ll 
20 214-10: E.coli A89/pBG214-10 

215-7 : E.coli A89/pBG215-7 

These cultures were assigned accession 
numbers IVI 10183-10185 respectively. 

While we have hereinbefore described a 
25 number of embodiments of this invention, it is 

apparent that our basic constructions can be altered 
to provide othe embodiments which utilize the pro- 
cesses and compositions of this invention. There- 
fore, it will be appreciated that the scope of this 
30 invention is to be defined by the claims appended * 
hereto rather than by the specific embodiments which 
have been presented hereinbefore by way of example. * 
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CLAIMS 

We claim: 

1. A DNA sequence selected from the group 
consisting of: 

5 (a) the DNA inserts of pl99-7, pBG377, 

PBG380, pBG381, p203-5, pBG391, pBG392, pBG393, pBG394, 
pBG395, pBG396, pBG397, p211-ll, p214-10 and p215-7; 

(b) DNA sequences which hybridize 

to one or more of the foregoing DNA inserts and which 
10 code on expression for a soluble T4-like polypeptide; 
and 

(c) DNA sequences which code on 
expression for a soluble T4-like polypeptide coded 
for on expression by any of the foregoing DNA inserts 

15 and sequences. 

2. The DNA sequence according to claim 1, 
wherein said DNA sequence (b) codes on expression 
for a soluble T4-like polypeptide which inhibits 
adhesion between T4 + lymphocytes and infective agents 

20 which target T4 + lymphocytes and which inhibits 

interaction between T4 + lymphocytes and antigen pre- 
senting cells and targets of T4 + lymphocyte mediated 
killing. 

3. A recombinant DNA molecule comprising 
25 a DNA sequence selected from the group consisting of 

the DNA sequences of claim 1 or 2, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

4. The recombinant DNA molecule according 
30 to claim 3, wherein said expression control sequence 

is selected from the group consisting of the early 
or late promoters of SV40 or adenovirus, the lac 
system, the trp_ system, the TAC system, the TRC . 
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system, the major operator and promoter regions of 
phage k, the control regions of fd coat protein, the 
promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, 
5 the polyhedron promoter of the baculovirus system 
and the promoters of the yeast ct -mating factors. 

5. A unicellular host transformed with a 
recombinant DNA molecule selected from the group 
consisting of the recombinant DNA molecules of claim 3 
X0 or 4. 

6- The host according to claim 5, wherein 
said host is selected from the group consisting of 
strains of E.coli , Pseudomonas , Bacillus , 
Streptomyces , fungi, animal cells, plant cells, 
15 insect cells and human cells in tissue culture. 

7. A polypeptide coded for on expression 
by a DNA sequence selected from the group consisting 
of the DNA sequences of claim 1 or 2, said polypep- 
tide being essentially free of other proteins of 

20 human origin. 

8. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA_ 2 3" AA 362 
of Figure 3, a polypeptide of the formula AA 1 _ 362 of 

25 Figure 3, a polypeptide of the formula Met-AA 1-362 

of Figure 3, a polypeptide of the formula AA 1 _ 3?4 of 
Figure 3, a polypeptide of the formula Met-AA X _ 374 
of Figure 3, a polypeptide of the formula AA 1-377 of 
Figure 3, a polypeptide of the formula Met-AA 1-377 

30 of Figure 3, a polypeptide of the formula AA _23~ AA 374 
of Figure 3, a polypeptide of the formula AA _23"" AA 377 
of Figure 3 . 
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9. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the 
group consisting of a polypeptide of the formula 
AA_ 23 -AA 182 of Figure 16, a polypeptide of the 
formula AA 1 -AA lg2 of Figure 16, a polypeptide of 
the formula Met-AA^^ of Figure 16, a polypeptide 
of the formula AA_ 23 -AA 1Q2 of Figure 16, followed 
by the amino acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the formula 
AA -AA 1Q2 of Figure 16, followed by the amino acids 
asparagine-leucine-glutamine-histidine-serine-leucine, 

a polypeptide of the formula Met-AA 1 . 182 of Figure 16, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
15 the formula AA^-AA^ of Figure 16, a polypeptide 
of the formula AA^AA^ of Figure 16, a polypeptide 
of the formula Met-AA 1-113 of Figure 16, a polypeptide 
of the formula AA^-AA^ of Figure 16, a polypeptide 
of the formula AA^AA^ of Figure 16, a polypeptide 
of the formula Met-AA 1 _ 111 of Figure 16, a polypep- 
tide of the formula AA^-AA^ of Figure 16, a poly- 
peptide of the formula AA-j-AA^ of Figure 15, a 
polypeptide of the formula Met-AA 1-131 of Figure 16, 
a polypeptide of the formula AA_ 23 -AA 145 of Figure 16, 
25 a polypeptide of the formula AA^AA^ of Figure 16, 

a polypeptide of the formula Met-AA 1-145 of Figure 16, 
a polypeptide of the formula AA_ 23 -AA 166 of Figure 16, 
a polypeptide of the formula AAj-AA^ of Figure 16, 
a polypeptide of the formula Met-AA^^ of Figure 16, 
30 or portions thereof. 

10. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA_ 23 -AA 362 
of mature T4 protein, a polypeptide of the formula 

35 AA 352 of mature T4 protein, a polypeptide of the 

formula Met-AA 1 _ 362 of mature T4 protein, a polypep- 



20 
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tide of the formula AA 1 _ 374 of mature T4 protein, a 
polypeptide of the formula Met-AA 1 _ 3?4 of mature T4 
protein, a polypeptide of the formula AA 1-377 of 
mature T4 protein, a polypeptide of the formula 
5 Met-AA 1-377 of mature T4 protein, a polypeptide of 

the formula AA^-AA^ of mature T4 protein, a poly- 
peptide of the formula kA _ 2 3~ AA 371 of mature T4 pro " 
tein, or portions thereof. 



11. The polypeptide according to claim 7, 
10 wherein said polypeptide is selected from the group 

consisting of a polypeptide of the formula AA _23~ AA i82 
of mature T4 protein, a polypeptide of the formula 
AA 1 -AA 1Q2 of mature T4 protein, a polypeptide of the 
formula Met-AA 1 _ 182 of mature X4 protein, a polypep- 

15 tide of the formula AA_ 23 -AA 1Q2 of mature T4 protein, 
followed by the amino acids asparagine-leucine- 
glutamine-histi dine- serine- leucine, a polypeptide of 
the formula AA 1 -AA 1S2 of mature T4 protein, followed 
by the amino acids asp aragine- leucine- glutamine- 

20 hi stidine- serine -leucine, a polypeptide of the formula 
Met-AA 1-182 of mature T4 protein, followed by the 
amino acids asparagine-leucine-glutamine-histidine- 
serine- leucine, a polypeptide of the formula 
AA_ 23 -AA 113 of mature T4 protein, a polypeptide of 

25 the formula AA^AA^ of mature T4 protein, a polypep- 
tide of the formula Met-AA 1-113 of mature T4 protein, 
a polypeptide of the formula AA _23~ AA 111 0f mature 
T4 protein, a polypeptide of the formula AA^-AAj^ 
of mature T4 protein, a polypeptide of the formula 

30 Met-AA 1 _ 111 of mature T4 protein, a polypeptide of 

the formula AA -23 -AA 131 of mature T4 protein, a poly- 
peptide of the formula AA 1 -AA 131 of mature T4 protein, 
a polypeptide of the formula Met-AA 1->131 of mature 
T4 protein, a polypeptide of the formula AA_ 23 -AA 145 

3 5 of mature T4 protein, a polypeptide of the formula 

AA X -AA 145 of mature T4 protein, a polypeptide of the 
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formula Met-AA 1 _ 145 of mature T4 protein, a polypep- 
tide of the formula AA_ 23 -AA 166 of mature T4 protein, 
a polypeptide of the formula AA^AA^g of mature T4 
protein, a polypeptide of the formula Met-AA 1-166 of 
5 mature T4 protein, or portions thereof. 

12. A method for producing a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11 comprising the step of 
culturing a unicellular host transformed with a recom- 

10 binant DNA molecule selected from the group consisting 
of the recombinant DNA molecules of claim 3 or 4. 

13. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a polypeptide selected from the group con- 

15 sisting of the polypeptides of any one of claims 7 to 
11 and a pharmaceutical ly acceptable carrier. 

14. A method for treating patients com- 
prising the step of treating them in a pharmaceuti- 
cally acceptable manner with a composition selected 

20 from the group consisting of the composition of 
claim 13. 

15. The method according to claim 14, 
wherein the patient is treated by intramuscular 
injection of the composition. 

25 16. A diagnostic composition for detecting 

or for monitoring the course of HIV infection com- 
prising a diagnostic effective amount of a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11. 



30 



17. A method for detecting or for moni- 
toring the course of HIV infection comprising the 
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step of employing as a diagnostic a composition 
selected from the group consisting of the compositions 
of claim 16. 

18. A means for detecting or for monitoring 
5 the course of HIV infection comprising a composition 

selected from the group consisting of the compositions 
of claim 16. 

19. A pharmaceutical composition compris- 
ing an immunotherapeutic or immunosuppressive effec- 

10 tive amount of antibody to a polypeptide selected 

from the group consisting of the polypeptides of any 
one of claims 7 to 11 and a pharmaceutical ly accept- 
able carrier. 

20. A method for treating patients com- 
15 prising the step of treating them in a pharmaceuti- 

cally acceptable manner with a composition according 
to claim 19. 

21. The use of a polypeptide selected 
from the group consisting of the polypeptides of any 

20 one of claims 7 to 11 to purify HIV virus. 

22. The use according to claim 20, wherein 
the HIV virus is purified from a biological sample. 

23. A method for purifying HIV virus from 
a sample comprising the step of exposing the sample 

25 to a polypeptide selected from the group consisting 
of the polypeptides of any one of claims 7 to 11. 

24. The method according to claim 22, 
wherein the sample is a biological sample.. 
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25. A DNA sequence comprising the DNA 
insert of pl70-2, said sequence coding on expression 
for a T4-like polypeptide. 

26. A recombinant DNA molecule comprising 
5 a DNA sequence selected from the group consisting of 

the DNA sequence of claim 25, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

27. A unicellular host transformed with a 
za recombinant DNA molecule according to claim 26. 

28. A polypeptide coded for on expression 
by a DNA sequence of claim 25, said polypeptide being 
essentially free of other proteins of human origin. 

29. A pharmaceutical composition comprising 
15 an immunotherapeutic or immunosuppressive amount of a 

soluble protein receptor and a pharmaceutic ally 
acceptable carrier. 

30. A method for treating patients 
comprising the step of treating them in a pharma- 

20 ceutically acceptable manner with a pharmaceutical 
composition of claim 29. 

31. A diagnostic composition for detecting 
or for monitoring the course of viral infection com- 
prising a diagnostic effective amount of a soluble 

25 protein receptor. 

32. A method for detecting or for moni- 
toring the course of a viral infection comprising 
the step of employing as a diagnostic a diagnostic 
effective amount of a soluble protein receptor. 
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33 . A means for detecting or for moni- 
toring the course of a viral infection comprising 
a soluble protein receptor. 

34. A DNA sequence selected from the group 
consisting of: 

(a) the DNA insert of pBiv.l ; 

(b) DNA sequences which hybridize to 
the DNA insert of pBiv.l and which code on expression 
for a polyvalent soluble T4-like polypeptide; and 

(c) DNA sequences which code on 
expression for a polyvalent soluble T4-like polypep- 
tide coded for by the DNA insert of pBiv.l. 

35. A recombinant DNA molecule comprising 
a DNA sequence selected from the group consisting of 
the DNA sequences of claim 34, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

36. A unicellular host transformed with a 
recombinant DNA molecule according to claim 35. 

37. A polypeptide coded for on expression 
by a DNA sequence selected from the group consisting 
of the DNA sequences according to claim 34, said 
polypeptide being essentially free of other proteins 
of human origin. 

38. The polypeptide according to claim 7, 
wherein said polypeptide is polyvalent. 

39. A method for producing a polyvalent 
polypeptide comprising the steps of: 

(a) culturing a unicellular host 
transformed with a recombinant DNA molecule according 
to claim 3 or 4 to produce a polypeptide; and 
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(b) coupling said polypeptide to a 
carrier to form a polyvalent polypeptide. 

40. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin light chain; and 

(b) a second portion comprising a 
DNA sequence according to claim 1 or 2, or portions 
thereof, said second portion being joined upstream 
of said first portion. 

41. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin heavy chain; and 

(b) a second portion comprising a 
DNA sequence according to claim 1 or 2, or portions 
thereof, said second portion being joined upstream 
of said first portion. 

42. An expression vector comprising the 
DNA sequence according to claim 40. 

43. An expression vector comprising the 
DNA sequence according to claim 41. 

44. An expression vector comprising the 
DNA sequence according to claim 40 and the DNA 
sequence according to claim 41. 

45. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of co-trans fecting a 
host cell with the expression vector according to 
claim 42 and the expression vector according to 
claim 43 . 
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46. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of trans fecting a host 
cell with the expression vector according to claim 44. 

47. A chimeric rsT4/IgG 1 produced by the 
method according to claim 45 or 46. 



48. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a polypeptide according to claim 37 or 38. 

49. A method for treating patients com- 
prising the step of treating them in a pharmaceuti- 
cally acceptable manner with a composition according 
to claim 48. 



50. A diagnostic composition for detecting 
or for monitoring the course of HIV infection com- 
prising a diagnostic effective amount of a polypeptide 
according to claim 37 or 38. 

51. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a chimeric rsT4/IgG 1 according to claim 47. 

52. A method for treating patients com- 
prising the step of treating them in a pharmaceuti- 
cally acceptable manner with a composition according 
to claim 51. 
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BSS 3ES R 

iCC ICC X i HM 

tor tor 1 J ha 

121 121 1 1 11 

saurrxTJUuic^ 3300 



GlyLauZn 



ld ArgTyrGXnAl*»li«»roProfilyS«rS«rL«ttValArgS«r»roValPro 



4 ap i I2 

/ —-——«-- -'•"^GGGXACCGTGGCGCTTTCT 



ACCCTGCCGCTTACCGGXTXCCTGTCCGCCTTrCTCCCTTCC 

J301 -£££f££^ 



3360 



3361 



TGCGACC 

ThrL*uProI^uTnrGirryrUuS«rAiaPh*L«uProS«rtlyS«rvalAiaL«uS«r - 

D . A 

d 1 

: ! 

CAATtXTCXCtXTGTAfiGTXTCTCAGTTCGGTCTACGTCGTTCGCTC CAAGCTGGGCTGT 



3420 



G TTAC GAGTGCGXCATC CATAGAGTCAAGCCACATC CAGCAAfiCGAGCTTC GACC C GACA 
GlnCyaS«rArgCy»AxgTyrL«uS«rSarValEa<iValValArgSarLy«LauGlyCyt 



3421 



i an nr o ** 

l lio v B4 P a ap« J 

1 S3 1 2H 1 1 213 1 

GTGCXCGAACCCCCCGTTCACCCCGA^ 

^ACGWTTG^GGGCAAGTC 
7alHiaGluProProValGlaProAjpArgCytXlaL«uS«rGlyAattTyrArgL«uGlu - 

KMHC 5 S BUB M 

p- cr * u b * b ? 

SS H H 13 1 1 

TCCAACCC^TAAGACACGACTTATCGCCACTGGCAG^ 

3 * a 1 j^3gttgggccattcwtg^tg^\atagcg^ 

S«rAanProValArgHiiA«pL«uS«rPToL«uAlaAlaAlaThrGlyAaaArgIlaS«s - 

R 
a 



3480 



3S40 



3 

:CTAACTACCGCTA 

* 3600 

TCTCGCTCCA1 

ArgAlaArgTyrValGlyGlyAlaThrGluPhaLeuLysTiTsTrpProAanTyTGlyTyT - 



AGAGC GAGGTATGTAGGC GGTGCTACAGAGTTCTTGAAG^G TG<KCTAACTAC GGCTAC 
3541 ^TACATCC GC CAC GATGTCTCAAGAACTTCAC CAC C GGATTGATGC C GATG 
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" 1 2 2 

A a a * 

• p * ; 

1 11 3 

ACTAflAAGGACXGTATTTCCTATCT^^TCTGCTGAA^CAflTTACCTTCCGAAAAAGX 



MCI 



3660 



^TCTTCCTCTCATAAACCATAGACCCGAGACGA^ 

ThrArgArgThrV«lPh*GlyIl«CyiAlaL«ut«uLyiProV*lThrPh«GlyLyiArg - 

S M 

X KHDaHM ■ 

1 bdpupa P 

u o«a3ap 3 

1 121A21 2 



X«l 



3720 



caaccatc gagaactacgccgtttg TTTGGTGGC GACCATCCCCACCXAAAAAACAAACG 
ValGlyS*rS«r2adS«rtlyLy«GlaTHrTlirXlaGlyS*r<Sly<31yPh«Ph«V4lCyi - 

F 7 H S S S 

n BnHiT HNDaJC KHOaJt MNDM* 

u buhiin bdpuh bdpua bdpbu 

4 vO*P» o«a3o oaa3o o«ao3 

H 12X11 121A2 121A2 1212A 

AAcau^uacrcxcGct^xc^^ 3780 

3721 TOCTCGTC^TG^G' lCm - n 

LytGlaGlnIl«TarArgAxgLyiLysGlyS«rGlaGiuAjpProL«uIlePh«SerThr - 

0 8 M lb 

• * • ? 2 

11 2 3 

GGCTCTGACGCTCAGTG<^GAAAAC^ 3840 

3791 ^EaGAC WGA^CCWGCOTGAGTG^ 

GlyS«rAjpAlaalaTrpA*aGluA*aSarArgEadGlyIl«L«uVaiM«tArgLeuSar - 
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S 3 



00 



H KKOaJE M MNDaJC BMOMaJC MHO* A 

p bdpua a Wpuh abpdua bdpu 1 

5 o«a3o ft o«a3o loa«3o o«a3 a 

1 121A2 1 121A2 2112A2 121A 1 

AAAJU3GATCTTCACCTAGATCCTTTTCAGA 3 9 

3841 Stoctagaagtg^tctag^^ 

LyiArgIl«Pa«ThrEadIl«LauPheArgS«rPToAapLeuEadLauS«rTrpPh«Alft - 

* » 2 s 

m h d na 

Pa • 1* 

11 / 
CJUUGCCCATTGCATAATCTTTCACGG^^^ 3950 

3901 G^CGCGTAACGTAWA^ 

GlnSarAlaLauHisAaaLauSerGlyLauCysValValProTyrAsnLauLeuSarThr - 
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4021 



4081 



4141 



NaH « i 

ipp a » 

311 1 2 

ACGTTOTAATAG^GGTCTCCA^ 

CytAjaHitTyrHiiXrgGlnArgEndwaS«r<3UHliXl*XrgCytEndIltPb«IU - 

BR F a 

HM •9^ M n A 

p« pi« n u 1 

12 212 1 H 1 

C CTTGCGCTGATAGATTTAAC G^ATGAGCACAAAAAAGAAAC ^^^f+^^^^T^^* + Q80 
GG^CCCACTATCTAAATTG^ 

ProCytGlyXjpArgPH«A*nV*lEndJU.aGlnLysArgA*aHiiEndKisLy»SarS«r - 

a m k 
b * 9 
v « t 

TTGAGOACG<icTC(^CTTJUUWCAAW ^, i( 

Aj^TCC^cTccxKc^TTrcGrrAAATAcrrri i in 1 1 1 r iacttgaaccgaata 

L«uArgThrHIftV«lAl*L«utytGla*h«H»tLyiLyiXrgLyiM«tAjnL«uAl*Tyr - 
3ES H 

•cc i * 

tor a ° 

121 1 1 

CCCAKUATCTCTCGCAGACAAfiAW^^ 4 2 0 ( 

wtccSagx^gcctctcwtacccctaccccgtcagtccgc^ 
ProAxgA*nL«uS«rGlaTbrArgTrpGlyTrpGlyS«rGlnAlaL«uValL«uTyrLau - 



S f 

fir a 

ft* u 

Hi ♦ 

IX ^ 
ATGGCATCXXTGCXTTJUUlTGCTTATAACCCCGCArrGCTTACAAAAAT^ 

,«-. — --——■*■**•*"■""'"""""* 425< 

TACC GTAGTTACG TAATTTAC GAATATTGCGGCGT AAC GAATGTTTTTAAGAGTTTCAAT 

M«tAlaS«rMfttHi3EndMett«aIl«Thr?ToHisCyaLauGlnLyaPheS«rLy8Leu - 
M 

b 

o 

KGTTGAAGAATTTAGCC CTTCAATCGC CAGAGAAATC ^^^^^3* « « ♦ 4 3 2 
4261 CGCAACTTCCTAAA^CG^JGAAGT^ 

AlftLauLyiAjaLauAIaLftuGlaS«rProGluLyftSftrthrArgCy«M«ttysArgL«u - 
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a M OB R ? 

u a d b • i 

4 • • V * * 

GTATGCAGC CGTCACTTAGAAGTGAGTATGAGTAC C CTGTTTTTTC ^^^^^^^ 43aQ 

^tacctcgccactg^tcttc^^^t^ 

v«lcy«s«rArgHiiL«u(31uValS«rM«tS«rTirL«uPh«Ph«L«uM«tPh«XrgGl!i - 



h on AD f * 

Sdon Id a P 

•*d u« N a 

^wrrcrcACCTAAGCTT^^ 

4381 ESACAAGAG^mGAA^^ 

GlyCy«d«rHi«L«uS«rL«uGluJroL«uProLytValM«tAr<5Xr<^jp<;iyEndAla - 

H M 

i BM * 

a «a 1 
f al 4 

4441 GTTGGTTTTTTCGGTCJ^TAAG^GT CATTAAGGTACTGGC 
GlnProLytLytProValIleL«uHi«S«rtlyL«uArgI.«uLyivaiIleProEadPro - 



H H 
H i A J J 

d u c • 



a d u 2 1 

cjkcaacjig^tccjuigcc^ 45 
gwtotccg^wggtogj^g^wcSa^ 

Hi«GlaGlnAlaProS«rGlaAlaPhaLeuThrGluCyaEadPheSerLauThrLauS«r - 



121 1 L 

k /winv- 1 TT?r rev AT l G£ ^GACTTGGGGGTGATGAGT* TAC C TT Ca 

520 



4521 



ACGCTCTI^GCCAGGTGATTTCTG^ 

TCC GACAACTC GG TC CACT AAAGAC GTATC GGTC TG AAC CCC CACTACTCAAATGG AA6T 

Ar5Leut«uS«rGlaValIlBS«rAlaSadPraAspLeuGlyValM«tS«rLau?roS«r - 

R 
• 

a 

1 

AGAAACTAATTAGGGATAGCGGTCAGGTGTT T^TACAAC^CTA^CCCA^GTACCCAA 
TCTTWCTAATCCCTATCGC^TC 

ArgAjaEadLauGlylleAlaVaUrgCysPhaTyrAanHisEndThrHiaSarTJirGla - 

sussf rruTE sflfiEt 



WO 89/01940 



PCT/US88/02940 



4681 



4741 



39133 



FIG. //(cont'd) 



MW)a M 5 5 
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121A 3 1 3 

TGATCCCATGCAATGAGACraTTCCGTW 4740 

JSA^TACeTTACTCTC^ 

EndS«rHiaXl*M«ttArgv*lv«l»roL«uTrpGlyLyiL«uS«rL«uV*ls«rGlyL«u - 

3 « 

M M MHDa i H A M 

a b bdpu S h 1 5 

2 2 121A 111 Z 



AAGAGACGTTrGCCTGATCCGCAAGC^^ 
mTCTGCAAWCG^TATCCG^ 

LyiArgXrgI^uXl*AjpArgGlnGlyv*lL«uValGlyXl*EadL«ull«Thrll«<;iu 
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ab u« ag« ba pa pbdpun 

£ V 4H 4at vl ap hoaa3o 

ll HI BU U 21 U21A2 

CTTCCTXGAWTAiGCCCCgX^ 

GlaGlaS«rS«rSartlyL«uGla?roTBrMatAxg?roAlaBBdArgIl«s«rHitL«u 
CCJUUUJWTGCCCCCCTGCAAAJU^^ 

'mttottacggcggg^g^^ 

ProA*nAjnAlaProL«uGlaLy«IlaA*nSarTyrLyiLy8Hi«TnrAjpA*nHifLau 



4860 



4920 



a b h 

Hi D K3C 

p B * ppl 

£ e a hlA 

CCGTGATAAACTATCTCTG«GGTCTTGACATA^ 

ArgEadEndriatleSarGlyGlyVaiAspriaAanTnrTarGlyClyXapThrGluHi* - 

H H MM 

o a a a ah 

! 13 3 11 



4930 



4981 



ATCAGCAGGAC GCAC T GAC CAC CATGAAGG TGACGCTCTTaXaATTAAGC C ^ "^^^^ 
TAGTCGTCCWGTG^TGGTG^TACWC^^ 
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?&G368 backbone p j G. 15 

:S a)ub1« T4#3 
; AA #3 = LYS 

bg3Sl.seq Length: 6151 

1 GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATAGGTGT AGGCCACQTG ACCGGGTGTT CCTGAAGGGG 

55 1 GGCTATA AAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

- 601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 

1001 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCAA GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 

1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 

1301 AAAAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 

1351 AT A CAATTCC ACTGGAAAAA CTCCAACCAG ATAAAGATTC TGGGAAATCA 

140 1 GGGCTCCTTC TTAACTA A AG GTCCATCCAA GCTGAATGAT CGCGCTGACT 
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irm F/ G. 15 (cont'd) 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAACT TfcCCCTGAT C ATCAAGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

L551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACCAGAAGA AGGTGGAGTT CAAAATAGAC 

1801 ATCGTGGTGC TAGCTTTCCA GAAGGCCTCC AGCATAGTCT ATAAGAAAGA 

1851 GGGGGAACAG GTGGAGTTCT CCTTCCCACT CGCCTTTACA GTTGAAAAGC 

1901 TGACGGGCAG TGGCGAGCTG TGGTGGCAGG CGGAGAGGGC TTCCTCCTCC 

195 J AAGTCTTGGA TCACCTTTGA CCTGAAGAAC AAGGAAGTGT CTGTAAAACG 

2001 GGTTACCC AG GACCCTAAGC TCCAGATGGG CAAGAAGCTC CCGCTCCACC 

2051 TCACCCTGCC CCAGGCCTTG CCTCAGTATG CTGGCTCTGG AAACCTCACC 

2 101 CTGGCCCTTG AAGCGAAAAC AGGAAAGTTG CATCAGGAAG TGAACCTGGT 

2151 GGTGATGAGA GCCACTCAGC TCCAGAAAAA TTTGACCTGT GAGGTGTGGG 

2201 GACCCACCTC CCCTAAGCTG ATGCTGAGTT TGAAACTGGA GAACAAGGAG 

2251 GCAAAGGTCT CGAAGCGGGA GAAGGCGGTG TGGGTGCTGA ACCCTGAGGC 

2301 GGGGATGTGG CAGTGTCTGC TGAGTGACTC GGGACAGGTC CTGCTGGAAT 

2351 CCAACATCAA GGTTCTGCCC ACATGGTCGA CCCCGGTGCA GCCAATGGCC 

2401 CTGATTTGAG ATCTTTGTGA AGGAACCTTA CTTCTGTGGT GTGACATAAT 

2451 TGGACAAACT ACCTACAGAG ATTTAAAGCT CTAAGGTAAA TATAAAATTT 

2501 TTAAGTGTAT AATGTGTTAA ACTACTGATT CTAATTGTTT GTGTATTTTA 

2551 GATTCCAACC TATGGAACTG ATGAATGGGA GCAGTGGTGG AATGCCTTTA 

2601 ATGAGGAAAA CCTGTTTTGC TCAGAAGAAA TGCCATCTAG TGATGATGAG 

265 1 GCTACTGCTG ACTCTCAACA TTCTACTCCT CCAAAAAAGA AGAGA AAGGT 

2701 AGAAGACCCC AAGGACTTTC CTTCAGAATT GCTAAGTTTT TTGAGTCATG 

2751 CTGTGTTTAG TAATAGAACT CTTGCTTGCT TTGCTATTTA CACCACAAAG 

2801 GAAAAAGCTG CACTGCTATA CAAGAAAATT ATGGAAAAAT ATTCTGTAAC 

2851 CTTTATAAGT AGGCATAACA GTTATAATCA TAACATACTG TTTTTTCTTA 

2901 CTCCACACAG GCATAGAGTG TCTGCTATTA ATAACTATGC TCAAAAATTG 

2951 TGTA CCTTTA GCTTTTTAAT TTGTAAAGGG GTTAATAAGG AATATTTGAT 

3001 GTATAGTGCC TTGACTAGAG ATCATAATCA GCCATACCAC ATTTGTAGAG 

3051 GTTTTACTTG CTTTAAAA A A CCTCCCACAC CTCCQCCTGA ACCTGAAACA 
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FIG.I5(conf'd) 

AATTGTTG TTGTTAACTT GTT1 



H8/?3 

3101 TAAAATGAAT GCAATTGTTG TTGTTAACTT gYttATTGCA GCTTATAATG 
3151 GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT 
3201 TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 
3251 TGTCTGGATC CTCTACGCCG GACGCATCGT GGCCGGCATC ACCGGCGCCA 
3301 CAGGTGCGGT TGCTGGCGCC TATATCGCCG ACATCACCGA TGGGGAAGAT 
3351 CGGGCTCGCC ACTTCGGGCT CATGAGCGCT TGTTTCGGCG TGGGTATGGT 
3401 GGC AGGCCCG TGGCCGGGGG ACTGTTGGGC GCCATCTCCT TGCATGCACC 
3451 ATTCCTTGCG GCGGCGGTGC TCAACGGCCT CAACCTACTA CTGGGCTGCT 
3501 TCCTAATGCA GGAGTCGCAT AAGGGAGAGC GTCGACCGAT GCCCTTGAGA 
3551 GCCTTCAACC CAGTCAGCTC CTTCCGGTGG GCGCGGGGCA TGACTATCGT 
3601 CGCCGCACTT ATGACTGTCT TCTTTATCAT GCAACTCGTA GGACAGGTGC 
365 1 CGGCAGCGCT CTGGGTCATT TTCGGCGAGG ACCGCTTTCG CTGGAGCGCG 
3701 ACGATGATCG GCCTGTCGCT TGCGGTATTC GGAATCTTGC ACGCCCTCGC 
375! TCAAGCCTTC GTCACTGGTC CCGCCACCAA ACGTTTCGGC GAGAAGCAGG 
380 1 CCATTATCGC CGGCATGGCG GCCGACGCGC TGGGCTACGT CTTGCTGGCG 
3851 TTCGCGACGC GAGGCTGGAT GGCCTTCCCC ATTATGATTC TTCTCGCTTC 
3901 CGGCGGCATC GGGATGCCCG CGTTGCAGGC CATGCTGTCC AGGCAGGTAG 
3951 ATGACGACCA TCAGGGACAG CTTCAAGGAT CGCTCGCGGC TCTTACCAGC 
4001 CTAACTTCGA TCACTGGACC GCTGATCGTC ACGGCGATTT ATGCCGCCTC 
4051 GGCGAGCAC A TGGAACGGGT TGGCATGGAT TGTAGGCGCC GCCCTATACC 
4 101 TTGTCTGCCT CCCCGCGTTG CGTCGCGGTG CATGGAGCCG GGCCACCTCG 
4151 ACCTGAATGG AAGCCGGCGG CACCTCGCTA ACGGATTCAC CACTCCAAGA 
4201 ATTGGAGCCA ATCAATTCTT GCGGAGAACT GTGAATGCGC AAACCAACCC 
4251 TTGGCAGAAC ATATCCATCG CGTCCGCCAT CTCCAGCAGC CGCACGCGGC 
4301 GCATCTCGGG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC GCCCCCCTGA 
4351 CGAGCATCAC AAAAATCGAC GCTCAAGTCA GAGGTGGCGA AACCCGACAG 
4401 GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 
4451 CCTGTTCCGA CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 
4501 GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG 
4551 TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG TGCACGAACC CCCCGTTCAG 
4601 CCCGACCGCT GCGCCTTATC CGGTAACTAT CGTCTTGAGT CCAACCCGGT 
4651 AAGACACGAC TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 
4701 GAGCGAGGTA TGTAGGCGGT GCTAC AGAGT TCTTGAAGTG GTGGCCTAAC 
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ACCGCGAGAC 


CCACGCTCAC 


CGGCTCCAGA 


5251 


tttatcagca 


ATAAACCAGC 


CAGCCGGAAG 


GGCCGAGCGC 


AGAAGTGGTC 


5301 


CTGCAACTTT 


ATCCGCCTCC 


ATCCAGTCTA 


TTAATTGTTG 


CCGGGAAGCT 


5351 


agagtaagta 


GTTCGCCAGT 


TAATAGTTTG 


CGCAACGTTG 


TTGCCATTGC 


5401 


tgcaggcatc 


GTGGTGTCAC 


GCTCGTCGTT 


TGGTATGGCT 


TCATTCAGCT 


5451 


ccggttccca 


ACGATCAAGG 


CGAGTTACAT 


GATCCCCCAT 


GTTGTGCAAA 


5501 


aaagcggtta 


GCTCCTTCGG 


TCCTCCGATC 


GTTGTCAGAA 


GTAAGTTGGC 


5551 


cgcagtgtta 


TCACTCATGG 


TTATGGCAGC 


ACTGCATAAT 


TCTCTTACTG 


5601 


TCATGCCATC 


CGTAAGATGC 


TTTTCTGTGA 


CTGGTGAGTA 


CTCAACCAAG 


5651 


TCATTCTGAG 


AATAGTGTAT 


GCGGCGACCG 


AGTTGCTCTT 


GCCCGGCGTC 


5701 


AACACGGGAT 


AATACCGCGC 


CACATAGCAG 


AACTTTAAAA 


GTGCTCATCA 


5751 


TTGGA AAACG 


TTCTTCGGGG 


CGAAAACTCT 


CAAGGATCTT 


ACCGCTGTTG 


5801 


AGATCCAGTT 


CGATGTAACC 


CACTCGTGCA 


CCCAACTGAT 


CTTCAGCATC 


5851 


TTTTACTTTC 


ACCAGCGTTT 


CTGGGTGAGC 


AAAAACAGGA 


AGGCAAAATG 


5901 


CCGC AAA A AA 


GGGA ATAAGG 


GCGACACGGA 


AATGTTGAAT 


ACTCATACTC 


5951 


TTCCTTTTTC 


AATATTATTG 


AAGCATTTAT 


CAGGGTTATT 


GTCTCATGAG 


5001 


CGGATACATA 


TTTGAATGTA 


TTTAGA AAAA 


TAAACAAATA 


GGGGTTCCGC 


6051 


GCACATTTCC 


CCGAAAAGTG 


CCACCTGACG 


TCTAAGAAAC 


CATTATTATC 


6101 


ATGACATTAA 


CCTATAAAA A 


TAGGCGTATC 


ACGAGGCCCT 


TTCGTCTTCA 


6151 


A 
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;ffi*a68 backbone iC* / /ZT 

ratable T4#7 f I L7. IO 

r AA #3 = LVS 
:. 152 A A+6AA 
-From 203-5 



bg392.soq Length: 6149 



1 


GAATTAATTC 


cagcttgctg 


1 UUAA IUIUI 


GTCAGTTAGG 


GTGTGGAAAG 


51 


TCCCCAGGCT 




CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


101 


GTCAGCAACC 


Abu I « I viua* 


AGTCCCCAGG 


CTCCCCAGCA 


GGCAGAAGTA 


151 


TGCAAAGCAT 


GCATCTuAA 1 


TAGTCAGCAA 


CCATAGTCCC 


GCCCCTAACT 


201 


CCGCCCATCC 


CGCCCCTAAC 




TCCGCCCATT 


CTCCGCCCCA 


251 


TGGCTGACTA 


ATTTTT T TTA 


TTT ATfir A ft A 


GGCCGAGGCC 


GCCTCGGCCT 


301 


CTGAGCTATT 


LtAuAAu 1 Au 


TftAfiflAGGCT 


TTTTTGGAGG 


GGTCCTCCTC 


351 


GTATAGAAAC 


TCGGACCACT 


PTn a a rr. A A 

L 1 UAuALUAA 


GGCTCGCGTC 

VJ Va# 1 w *J W \J 1 V* 


CAGGCCAGCA 


401 


CGAAGGAGGC 


TAAGTGGGAG 


Ijuj I Auuuu I 




TAGGGGGTCC 


451 


ACTCGCTCCA 


GGGTGTGAAG 


a r* a r* a TfiTrr. 
ALAUA 1 u 1 


rrrTCTTCGG 


CATCAAGGAA 


501 


GGTGATTGGT 


TTATAGGTGT 


AGGCCACviTti 




CCTGAAGGGG 


551 


GGCTATAAAA 


GGGGGTGGGG 


GCGCGTTCGT 


1 LAC ILIL 


TTTCGCATCG 


601 


CTGTCTGCGA 


GGGCCAGCTG 


TTGGGCTCGC 


^r.TTftAr.fiir 

UU I 1 UAUUAL 


AAACTCTTCG 

RMrt W 1 w 1 ' Wwl 


651 


CGG T CTTTCC 


AGTACTCTTG 


GATCGGAAAC 


wUu I Uvjolc < 


TTGAACGGTA 


701 


CTCCGCCACC 


GAGGGACCTG 




CATCGACCGG 


ATCGGAAAAC 


75 1 


CTCTCGAGAA 


AGGCGTCTAA 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


801 


CGTGGCGGGC 


GGCAGCGGGT 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


851 


TGCTGATGAT 


GTAATTAAAG 


t a /■*/■* c nm" p t 
TAuu^uu I v I 


TflAf^A mfirfi 

1 UAUKkfUUUU 


GATGGTCGAG 

win I w ■ v«nw 


901 


GTGAGGTGTG 


GCAGGCTTGA 


GATCGATCTG 


GCCATACACT 


TGAGTGACAA 


951 


TGACATCCAC 


TTTGCCTTTC 


TCTCCACAGG 


TGTCCACTCC 


CAGGTCCAAC 


1001 


TGGATCCAAG 


CTTCGACTCG 


AGGAATTCCC 


CGAAGGAACA 


AAGCACCCTC 


1051 


CCCACTGGGC 


TCCTGGTTGC 


AGAGCTCCAA 


GTCCTCACAC 


AGATACGCCT 


1101 


GTTTGAGAAG 


CAGCGGGCAA 


GAAAGACGCA 


AGCCCAGAGG 


CCCTGCCATT 


1151 


TCTGTGGGCT 


CAGGTCCCTA 


CTGGCTCAGG 


CCCCTGCCTC 


CCTCGGCAAG 


1201 


McT 

gccaca(atg> accggggagt 

tAA-23 

ACTGGCGCTC ctcccagcag 


CCCTTTTAGG 


CACTTGCTTC 


TGGTGCTGCA 


1251 


ccact|cag]gg 
ctgacctgta 


AAAGAAAGTG 


GTGCTGGGCA 


1301 


aaaaagggga 


TACAGTGGAA 


CAGCTTCCCA 


GAAGAAGAGC 


1351 


atacaattcc 


ACTGGAAAAA 


CTCCAACCAG 


ATAAAGATTC 


TGGGAAATCA 
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5i/93 F/G /6{cont'd) 

1401 GGGCTCCTTC TTAACTAAAG GTCCA+CCAA GCTGAATGAT CGCGCTGACT 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAACT TTCCCCTGAT CATCAAGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

1551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACCAGAAGA AGGTGGAGTT^CAAAATAGAC 

1801 ATCGTGGTGC TAGCTTTCCA GAACCTCCAG CATAGTCTA|T_AA/GAAAGAGG 

1B51 GGGAACAGGT GGAGTTCTCC TTCCCACTCG CCTTTACAGT TGAAAAGCTG 

1901 ACGGGCAGTG GCGAGCTGTG GTGGCAGGCG GAGAGGGCTT CCTCCTCCAA 

1951 GTCTTGGATC ACCTTTGACC TGAAGAACAA GGAAGTGTCT GTAAAACGGG 

2001 TTACCCAGGA CCCTAAGCTC CAGATGGGCA AGAAGCTCCC GCTCCACCTC 

2051 ACCCTGCCCC AGGCCTTGCC TCAGTATGCT GGCTCTGGAA ACCTCACCCT 

2101 GGCCCTTGAA GCGAAAACAG GAAAGTTGCA TCAGGAAGTG AACCTGGTGG 

2151 TGATGAGAGC CACTCAGCTC CAGAAAAATT TGACCTGTGA GGTGTGGGGA 

2201 CCCACCTCCC CTAAGCTGAT GCTGAGTTTG AAACTGGAGA ACAAGGAGGC 

2251 AAAGGTCTCG AAGCGGGAGA AGGCGGTGTG GGTGCTGAAC CCTGAGGCGG 

2301 GGATGTGGCA GTGTCTGCTG AGTGACTCGG GACAGGTCCT GCTGGAATCC 

2351 AACATCAAGG TTCTGCCCAC ATGGTCGACC CCGGTGCAGC CAATGGCCCT 

2401 GATTTGAGAT CTTTGTGAAG GAACCTTACT TCTGTGGTGT GACATAATTG 

2451 GACAAACTAC CTACAGAGAT TTAAAGCTCT AAGGTAAATA TAAAATTTTT 

2501 AAGTGTATAA TGTGTTAAAC TACTGATTCT AATTGTTTGT GTATTTTAGA 

2551 TTCCAACCTA TGGAACTGAT GAATGGGAGC AGTGGTGGAA TGCCTTTAAT 

2601 GAGGAAAACC TGTTTTGCTC AGAAGAAATG CCATCTAGTG ATGATGAGGC 

2651 TACTGCTGAC TCTCAACATT CTACTCCTCC AAAAAAGAAG AGAAAGGTAG 

2701 AAGACCCCAA GGACTTTCCT TCAGAATTGC TAAGTTTTTT GAGTCATGCT 

2751 GTGTTTAGTA ATAGAACTCT TGCTTGCTTT GCTATTTACA CCACAAAGGA 

2801 AAAAGCTGCA CTGCTATACA AGAAAATTAT GGAAAAATAT TCTGTAACCT 

2851 TTATAAGTAG GCATAACAGT TATAATCATA ACATACTGTT TTTTCTTACT 

2901 CCACACAGGC ATAGAGTGTC TGCTATTAAT AACTATGCTC AAAAATTGTG 

2951 TACCTTTAGC TTTTTAATTT GTAAAGGGGT TAATAAGGAA TATTTGATGT 

3001 ATAGTGCCTT GACTAGAGAT CATAATCAGC CATACCACAT TTGTAGAGGT 



WO 89/01940 



PCT/US88/02940 
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3051 TTTACTTGCT TTAAAAAACC Tc'cCACACCT CCCCCTGAAC CTGAAACATA 
3101 AAATGAATGC AATTGTTGTT GTTAACTTGT TTATTGCAGC TTATAATGGT 
3151 TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG CATTTTTTTC 
3201 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 
3251 TCTGGATCCT CTACGCCGGA CGCATCGTGG CCGGCATCAC CGGCGCCACA 
3301 GGTGCGGTTG CTGGCGCCTA TATCGCCGAC ATCACCGATG GGGAAGATCG 
3351 GGCTCGCCAC TTCGGGCTCA TGAGCGCTTG TTTCGGCGTG GGTATGGTGG 
3401 CAGGCCCGTG GCCGGGGGAC TGTTGGGCGC CATCTCCTTG CATGCACCAT 
3451 TCCTTGCGGC GGCGGTGCTC AACGGCCTCA ACCTACTACT GGGCTGCTTC 
3501 CTAATGCAGG AGTCGCATAA GGGAGAGCGT CGACCGATGC CCTTGAGAGC 
3551 CTTCAACCCA GTCAGCTCCT TCCGGTGGGC GCGGGGCATG ACTATCGTCG 
3601 CCGCACTTAT GACTGTCTTC TTTATCATGC AACTCGTAGG ACAGGTGCCG 
3651 GCAGCGCTCT GGGTCATTTT CGGCGAGGAC CGCTTTCGCT GGAGCGCGAC 
3701 GATGATCGGC CTGTCGCTTG CGGTATTCGG AATCTTGCAC GCCCTCGCTC 
3751 AAGCCTTCGT CACTGGTCCC GCCACCAAAC GTTTCGGCGA GAAGCAGGCC 
3801 ATTATCGCCG GCATGGCGGC CGACGCGCTG GGCTACGTCT TGCTGGCGTT 
3851 CGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCTT CTCGCTTCCG 
3901 GCGGCATCGG GATGCCCGCG TTGCAGGCCA TGCTGTCCAG GCAGGTAGAT 
3951 GACGACCATC AGGGACAGCT TCAAGGATCG CTCGCGGCTC TTACCAGCCT 
4001 AACTTCGATC ACTGGACCGC TGATCGTCAC GGCGATTTAT GCCGCCTCGG 
4051 CGAGCACATG GAACGGGTTG GCATGGATTG TAGGCGCCGC CCTATACCTT 
4101 GTCTGCCTCC CCGCGTTGCG TCGCGGTGCA TGGAGCCGGG CCACCTCGAC 
4151 CTGAATGGAA GCCGGCGGCA CCTCGCTAAC GGATTCACCA CTCCAAGAAT 
4201 TGGAGCCAAT CAATTCTTGC GGAGAACTGT GAATGCGCAA ACCAACCCTT 
4251 GGCAGAACAT ATCCATCGCG TCCGCCATCT CCAGCAGCCG CACGCGGCGC 
4301 ATCTCGGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 
4351 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 
4401 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 
4451 TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 
4501 GAAGCGTGGC GCTTTCTCAA TGCTCACGCT GTAGGTATCT CAGTTCGGTG 
4551 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 
4601 CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 
4651 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 

^L-Li^TjT'JTE SHEET 
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FIG. 16 (cont'd) 

4701 GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 

4751 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAC 

4801 TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCACC 

4851 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 

4901 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 

4951 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 

5001 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 

5051 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA 

5101 GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

5151 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 

5201 CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 

5251 TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 

5301 GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 

5351 AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTG 

5401 CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 

5451 GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 

5501 AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 

5551 CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 

5601 ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 

5651 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA 
5701. CACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCTCATCATT 
5751 GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 
5801 ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 
5851 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 
5901 GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 
5951 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 
6001 GATACATATT TGA ATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 
6051 ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA TTATTATCAT 
6101 GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTTCAA 
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5* CTA CCT TTT CCA CTC A 3 

FIG. 18 

5' CAT CTC ACT CCA AAG 3 ' 

63 

5* GGG CTC ATA CTA A 3* 

*± 

5' CAT CTT ACT ATC A 3 ' 

67 

5* GCC CCA CAC CCT CAC CCT CAC CTT CCA GAC CCC C 3' 

63 

S* CCC CCC CCC TCT CCA ACC TCA CCC TCA CCC TCT C 3 ' 

69 

3* CCC CCT ACT ACC CCC TCA CTC CAA TCA 3' 

70 

S' CAT CTC ATT CCA CTC ACC CCC TAC TAC 3' 

21 

3' CCS CCT ACT ACC CCC TCA CTC CAA TCT ACC ACT C 3 ' 

22 

3» TAC CAC TCC TAC ATT CCA CTC ACC CCC TAC TAC 3' 

22 

3' CTA CCC CTA AAA ACA TAC ACC CCC CCA ACA CCT GA 3' 
5' GAT CTC ACC TCT TTC CCC CCC TCT ATC TTT TTA CCC 3 

"S 1 CCA GCA TAC TCC CAC CTC GAC ATC CAC TCT CTT CCA 
CAA CTC A 3 * 

21 

5' CAT CTC ACT TCT GCA ACA CAC TCC ATC TCC AGC .G*_ 
CAC TAT CCT GCA CCT 3' 
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^368 backDone p J Q ^ 

so l uD 1 e T4#9 
A A 03 - LVS 
first 113 A A of T4 

Das ica 1 l y up to vul 

Cig394.seq Length; 5365 

! GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

5 1 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

10 1 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

25 1 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CC AGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

35 1 GTATAGAAAC TCGGACC ACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

40 1 CGA A GGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

45 1 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG C ATC A AGGAA 

50 1 GGTGATTGGT TTATAGGTGT AGGCCACGTG A CCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

60 1 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

65 1 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 C1CTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

601 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

35 1 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 
951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 
1001 TGGATCCAAG CTTCGAC TCG AGGAATTCCC CGAAGGA AC A AAGCACCCTC 
1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 
1101 GTTTGAGAAG CAGCGGGCA A GAA AGACGC A AGCCC AGAGG CCCTGCCATT 
U51 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
1201- GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 
125 1 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 
130 1 a A A A AGGGGA TACAGTGGAA ctgacctgta CAGCTTCCCA GAAGAAGAGC 
1351 ATACAATTCC ACTGGAAAAA CTCCAACCAG ATAAAGATTC TGGGAAATCA 
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FIG. 19 (cont'd) 



1401 


GGGCTCCTTC 


TTAACTAAAG 


GTCCATCCAA 


GCTGAATGAT 


CGCGCTGACT 


1451 


CAAGAAGAAG 


CTTGTGGGAC 


CAAGGAAACT 


TTCCCCTGAT 


CATCAAGAAT 


1501 


CTTAAGATAG 


AAGACTCAGA 


TACTTACATC 


TGTGAAGTGG 


AGGACCAGAA 


1551 


GGAGGA GGTG 


CA ATTGCTAG 


TGTTCGGATT 


GACTGCCAAC 


TCTuACACLL 


1601 


ACCTGCTTCA 


GGGGTGATAG 


TAAGATCTTT 


GTGAAGGAAC 


CTTACTTCTG 


1651 


TGGTGTGACA 


TAATTGGACA 


AACTACCTAC 


AGAGATTTAA 


AGCTCTAAGG 


1701 


TAAATATAAA 


ATTTTTAAGT 


GTATAATGTG 


TTAAACTACT 


GATTCTAATT 


1751 


gtttgtgtat 


TTTAGATTCC 


AACCTATGGA 


ACTGATGAAT 


GGGAGCAGTG 


1801 


GTGGA ATGCC 


TTTAATGAGG 


AAAACCTGTT 


TTGCTCAGAA 


GAAATGCCAT 


185 1 


CTAGTGATGA 


TGAGGCTACT 


GCTGACTCTC 


A ACATTCTAC 


TCCTCCAAAA 


190 1 


AAGAAGAGAA 


AGGTAGAAGA 


CCCCAAGGAC 


TTTCCTTCAG 


AATTGCTAAG 


1951 


TTTTTTGAGT 


CATGCTGTGT 


TTAGTAATAG 


AACTCTTGCT 


TGCTTTGCTA 


2001 


TTTACACCAC 


AAAGGAAAAA 


GCTGCACTGC 


TATACAAGAA 


A ATT ATGGAA 


205 1 


A A AT ATTCTG 


TAACCTTTAT 


AAGTAGGCAT 


AACAGTTATA 


ATCATAACAT 


2 1U1 


A C TGTTTTTT 


CTTACTCCAC 


ACAGGCATAG 


AGTGTCTGCT 


ATTAATAACT 


2 151 


ATGCTC AAA A 


ATTGTGTACC 


TTTAGCTTTT 


TAATTTGTAA 


AGGGGTTAAT 


2201 


A AGGA ATATT 


TGATGTATAG 


TGCCTTGACT 


AGAGATCATA 


ATCAGCCATA 


2 251 


CCACATTTGT 


AGAGGTTTTA 


CTTGCTTTAA 


AAAACCTCCC 


ACACCTCCCC 


7 30 1 


CTGAACCTGA 


AACATAAA AT 


GAATGCAATT 


GTTGTTGTTA 


ACTTGTTTAT 


2351 


T GC AGCTT AT 


AATGGTTACA 


AATAAAGCAA 


TAGCATCACA 


AATTTCACAA 


240 1 


ATAAAGCATT 


TTTTTCACTG 


CATTCTAGTT 


GTGGTTTGTC 


CAAACTCATC 


245 L 


AATGTATCTT 


ATC ATGTCTG 


GATCCTCTAC 


GCCGGACGCA 


TCGTGGCCGG 


250 I 


CATCACCGGC 


GCCACAGGTG 


CGGTTGCTGG 


CGCCTATATC 


GCCGACATCA 


2551 


CCGATGGGGA 


AGATCGGGCT 


CGCCACTTCG 


GGCTCATGAG 


CGCTTGTTTC 


2601 


GGCGTGGGTA 


TGGTGGCAGG 


CCCGTGGCCG 


GGGGACTGTT 


GGGCGCCATC 


2651 


TCCTTGC ATG 


CACCATTCCT 


TGCGGCGGCG 


GTGCTCAACG 


GCCTCAACCT 


2701 


ACTA C TGGGC 


TGCTTCCTAA 


TGCAGGAGTC 


GCATAAGGGA 


GAGCGTCGAC 


2751 


CGATGCCCTT 


GAGAGCCTTC 


AACCC AGTC A 


GCTCCTTCCG 


GTGGGCGCGG 


2801 


GGCATGACTA 


TCGTCGCCGC 


ACTTATGACT 


GTCTTCTTTA 


TCATGCAACT 


2B51 


CGTAGGACAG 


GTGCCGGCAG 


CGCTCTGGGT 


CATTTTCGGC 


GAGGACCGCT 


2901 


TTCGCTGGAG 


CGCGACGATG 


ATCGGCCTGT 


CGCTTGCGGT 


ATTCGGAATC 


295 1 


TTGCACGCCC 


TCGCTCAAGC 


CTTCGTCACT 


GGTCCCGCCA 


CCAAACGTTT 


juu : 


CGGCGAGA AG 


CAGGCCATTA 


TCGCCGGCAT 


GGCGGCCGAC 


GCGCTGGGCT 
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sa/?3 FIG. 19 (cont'd) 



JUS j- 


ATGTCTTGCT 


GGCGTTCGCG 


ACGCGAGGCT 


GGATGGCCTT 


CCCCATTATG 


1 1 A 1 

jIUi. 


ATTCTTCTCG 


CTTCCGGCGG 


CATCGGGATG 


CCCGCGTTGC 


AGGCCATGCT 


3151 


n t r r a f^r» r a r» 


GT AGATGACG 


ACCATCAGGG 


ACAGCTTCAA 


GGATCGCTCG 


•a •) n i 


rri r, nrTTAf 


CAGCCTAACT 


TCGATCACTG 


GACCGCTGAT 


CGTCACGGCG 


3251 


Ml 1 1 H 1 UL<-U 


CCTCGGCGAG 


CACATGGAAC 


GGGTTGGCAT 


GGATTGTAGG 


330 1 


LuLlulUL i A 


TACCTTGTCT 


GCCTCCCCGC 


GTTGCGTCGC 


GGTGCATGGA 


3351 




fTCGACCTGA 


ATGGAAGCCG 


GCGGCACCTC 


GCTAACGGAT 


340 1 


T L A L U A L 1 


HMwlMrA 1 i y w 


GCCAATCAAT 


TCTTGCGGAG 


AACTGTGAAT 


/(45 1 


GLGCAAALLA 


ArrrTTf^nrA 

MV»ww 1 1 U>wtw** 


GAACATATCC 


ATCGCGTCCG 


CCATCTCCAG 


3501 


r A/~r'/~f~PAr , n 




CGGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


355 1 


L i IluLLLLL 


TTGATGAGCA 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


360 1 


r f r k \ a r r r n. 


ATAGGACTAT 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


O C C 1 


rrrTrr.Tr,r("i 

V- ' UU 1 Llv*U 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG 


ATACCTGTCC 


J/Ul 


rr TTTTfTCC 


CTTCGGGAAG 


CGTGGCGCTT 


TCTCAATGCT 


CACGCTGTAG 


1 "J C 1 

3 7 5 1 




TCGGTGTAGG 


TCGTTCGCTC 


CAAGCTGGGC 


TGTGTGCACG 


Tt li r\ 1 




TTAGCCCGAC 


CGCTGCGCCT 


TATCCGGTAA 


CTATCGTCTT 


38 51 


uAb 1 L L A M Lt- 


TGGTAAGACA 


CGACTTATCG 


CCACTGGCAG 


CAGCCACTGG 


3901 


T A A/" AT.r. 1 TT 


AGCAGAGCGA 

M Ut w *4 V* w U ^ 


GGTATGTAGG 


CGGTGCTACA 


GAGTTCTTGA 


395 1 


Ab T (jlj I (juL L 


1 aMU 1 JUL 


TACACTAGAA 


GGACAGTATT 


TGGTATCTGC 


40G i 


^ T r T A 

GCTCTtjCTGA 


a^rrAr.TTAT 

AultAU 1 1 M w 


CTTCGGAAAA 


AGAGTTGGTA 


GCTCTTGATC 


405 1 


CGGCAAALAA 


ALLALLVJ— I Kt 


GTAGCGGTGG 


TTTTTTTGTT 


TGCAAGCAGC 


4 101 


A ATX A f~ !~ H (~~ 

AGATTALulu 




GGATCTCAAG 


AAGATCCTTT 


GATCTTTTCT 


4151 


AluOuu i t 1 u 




GAACGAAAAC 


TCACGTTAAG 


GGATTTTGGT 


420 1 


LA 1 oAtjA 1 1 A 


TPAAAAAGGA 


TCTTCACCTA 


GATCCTTTTA 


AATTAAA AAT 




GA AGTTTT A A 


ATCA ATCTA A 


AGTATATATG 


AGTAAACTTG 


GTCTGACAGT 


4JU 1 




TA AT C AG TGA 


GGC ACCTATC 


TCAGCGATCT 


GTCTATTTCG 


*4 JD 1 




GTTGCCTGAC 


TCCCCGTCGT 


GTAGATAACT 


ACGATACGGG 


440 1 


AGGoCTT all 


A 1 L 1 LiliL L L L 


a r,T rir t cc a a 


TGATACCGCG 


AGACCCACGC 


4451 


TCACCGGCTC 


CAGATTTATC 


AGC AATAAAC 


CAGCCAGCCG 


GAAGGGCCGA 


4501 


GCGCAGAAGT 


GGTCCTGCAA 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


4551 


GTTGCCGGGA 


AGCTAGAGTA 


AGTAGTTCGC 


CAGTTAATAG 


TTTGCGCAAC 


4601 


GTTG T TGCCA 


TTGCTGCAGG 


CATCGTGGTG 


TCACGCTCGT 


CGTTTGGTAT 


465 1 


GGCTT C ATTC 


AGCTCCGGTT 


CCCAACGATC 


AAGGCGAGTT 


ACATGATCCC 



SUBSTITUTE SHEET 



WO 89/01940 



59/93 



PCT/US88/02940 



FIG. 19 (cont'd) 



470 1 


CCATGTTGTG 


CAAAAAAGCG 


4751 


AGAAGTAAGT 


TGGCCGCAGT 


4801 


TAATTCTCTT 


ACTGTCATGC 


4851 


AGTACTCAAC 


CAAGTCATTC 


4901 


TCTTGCCCGG 


CGTCAACACG 


4951 


AAAAGTGCTC 


ATCATTGGAA 


5001 


TCTTACCGCT 


GTTGAGATCC 


5051 


TGATCTTCAG 


CATCTTTTAC 


5 101 


AGGAAGGCAA 


AATGCCGCAA 


5151 


GAATACTCAT 


ACTCTTCCTT 


5201 


TATTGTCTCA 


TGAGCGGATA 


5251 


AATAGGGGTT 


CCGCGCACAT 


5301 


AAACCATTAT 


TATCATGACA 


5351 


CCCTTTCGTC 


TTCAA 



GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 
GTTATCACTC ATGGTTATGG CAGCACTGCA 
CATCCGTAAG ATGCTTTTCT GTGA CTGGTG 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC 
GGATAATACC GCGCCACATA GCAGAACTTT 
AACGTTCTTC GGGGCGAAA A CTCTCAAGGA 
AGTTCGATGT AACCCACTCG TGCACCCAAC 
TTTCACCAGC GTTTCTGGGT GAGCAAAAAC 
AAAAGGGAAT AAGGGCGACA CGGAAATGTT 
TTTCAATATT ATTGAAGCAT TTATCAGGGT 
CATATTTGAA TGTATTTAGA AAAATA AACA 
TTCCCCGAAA AGTGCCACCT GACGTCTAAG 
TTAACCTATA AAAATAGGCG TATCACGAGG 
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PBG396 

:BG368 backbone 
rsoiubie T4#12 
: A A #3 = LVS 



<Z>0/93 

FIG. 20 



bg396.seq Length: 5518 

1 GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 
51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 
101 GTCAGCAACC AGGTGTGGA A AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 
151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 
201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 
251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 
30 1 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 
351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 
401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 
451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG C ATCAAGGAA 
501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 
551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 
601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 
651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 
75 1 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 
801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 
851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 
901 GTGAGGTGTG GC AGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 
951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 
1001 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA AAGCACCCTC 
1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 
1101 GTTTGAGAAG C AGCGGGCAA GAA AGACGCA AGCCCAGAGG CCCTGCCATT 
1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 
1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAA AGTG GTGCTGGGCA 
1301 AAAAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGA AGAGC 
1351 ATAC AATTCC ACTGGAAAAA CTCCAACCAG ATAAAGATTC TGGGAAATCA 
1401 GGGCTCCTTC TTA ACTAAAG GTCCATCCAA GCTGAATGAT CGCGCTGACT 
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«'/» FIG, 20 (cont'd) 



1451 


CAAGAAGAAG 


CTTGTGGGAC 


CAAGGAAACT 


TTCCCCTGAT 


CATCAAGAAT 


1501 


CTTAAGATAG 


AAGACTCAGA 


TACTTACATC 


TGTGAAGTGG 


AGGACCAGAA 


1551 


GGAGGAGGTG 


CAATTGCTAG 


TGTTCGGATT 


GACTGCCAAC 


TCTGACACCC 


1601 


ACCTGCTTCA 


GGGGCAGAGC 


CTGACCCTGA 


CCTTGGAGAG 


CCCCCCTGGT 


1651 


AGTAGCCCCT 


CAGTGCAATG 


TAGGAGTCCA 


AGGGGTAAAA 


ACATACAGGG 


1701 


GGGGAAGACC 


CTCTCCGTGT 


CTCAGCTGGA 


GCTCCAGGAT 


AGTGGCACCT 


1751 


ggacatgcac 


TGTCTTGCAG 


AACTGAGATC 


TTTGTGAAGG 


A ACCTTACTT 


1801 


CTGTGGTGTG 


ACATAATTGG 


ACAAACTACC 


TACAGAGATT 


TAAAGCTCTA 


185 J 


aggtaaatat 


aaaattttta 


AGTGTATAAT 


GTGTTAAACT 


ACTGATTCTA 


190 1- 


ATTGTTTGTG 


TATTTTAGAT 


TCCAACCTAT 


GGAACTGATG 


AATGGGAGCA 


195 1 


GTGGTGGAAT 


gcctttaatg 


AGGAAAACCT 


GTTTTGCTCA 


GAAGAAATGC 


200 1 


Catctagtga 


TGATGAGGCT 


ACTGCTGACT 


CTCAACATTC 


TACTCCTCCA 


105 1 


AAAA AGAAGA 


GAAAGGTAGA 


AGACCCCAAG 


GACTTTCCTT 


CAGAATTGCT 


2 10 1 


AAGTTTTTTG 


AGTCATGCTG 


TGTTTAGTAA 


TAGAACTCTT 


GCTTGCTTTG 


215 1 


ctatttacac 


CACA A AGGAA 


AAAGCTGCAC 


TGCTATACAA 


GAAAATTATG 


2201 


GAAAAATATT 


CTGTAACCTT 


TATAAGTAGG 


CATAACAGTT 


ATAATCATAA 


2251 


CAT A CTGTTT 


TTTCTTACTC 


CACACAGGCA 


TAGAGTGTCT 


GCTATTAATA 


230 1 


ACT ATGCTC A 


AAAATTGTGT 


ACCTTTAGCT 


TTTTAATTTG 


TAAAGGGGTT 


2351 


A ATAAGGA AT 


ATTTGATGTA 


TAGTGCCTTG 


ACTAGAGATC 


ATAATCAGCC 


2401 


ATACCACATT 


TGTAGAGGTT 


TTACTTGCTT 


TAAAAAACCT 


CCCACACCTC 


2451 


CCCCTGAACC 


TGAAACATAA 


AATGAATGCA 


ATTGTTGTTG 


TTAACTTGTT 


2501 


TATTGCAGCT 


TAT AATGGTT 


ACAAATAAAG 


CAATAGCATC 


ACAAATTTCA 


2551 


CAAATAAAGC 


ATTTTTTTCA 


CTGCATTCTA 


GTTGTGGTTT 


GTCCAAACTC 


2601 


ATCAATGTAT 


CTTATCATGT 


CTGGATCCTC 


TACGCCGGAC 


GCATCGTGGC 


2651 


CGGCATCACC 


GGCGCCACAG 


GTGCGGTTGC 


TGGCGCCTAT 


ATCGCCGACA 


2701 


TCACCGATGG 


GGAAGATCGG 


GCTCGCCACT 


TCGGGCTCAT 


GAGCGCTTGT 


2751 


TTCGGCGTGG 


GTATGGTGGC 


AGGCCCGTGG 


CCGGGGGACT 


GTTGGGCGCC 


280 1 


ATC7CCTTGC 


ATGCACCATT 


CCTTGCGGCG 


GCGGTGCTCA 


ACGGCCTCAA 


2851 


CCTACTACTG 


GGCTGCTTCC 


TAATGCAGGA 


GTCGCATAAG 


GGAGAGCGTC 


2901 


GACCGATGCC 


CTTGAGAGCC 


TTCAACCCAG 


TCAGCTCCTT 


CCGGTGGGCG 


295 1 


CGGGGC ATGA 


CTATCGTCGC 


CGCACTTATG 


ACTGTCTTCT 


TTATCATGCA 


3001 


ACTCGTAGGA 


CAGGTGCCGG 


CAGCGCTCTG 


GGTCATTTTC 


GGCGAGGACC 


305 1 


GC T TT CGC T G 


GAGCGCGACG 


ATGATCGGCC 


TGTCGCTTGC 


GGTATTCGGA 
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62/93 FIG. 20 (cont'd) 

3101 ATCTTGCACG CCCTCGCTCA AGCCTTCGTC ACTGGTCCCG CCACCAAACG 

3151 TTTCGGCGAG AAGCAGGCCA TTATCGCCGG CATGGCGGCC GACGCGCTGG 

3201 GCTACGTCTT GCTGGCGTTC GCGACGCGAG GCTGGATGGC CTTCCCCATT 

3251 ATGATTCTTC TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT 

3301 GCTGTCCAGG CAGGTAGATG ACGA C CATCA GGGACAGCTT CAAGGATCGC 

3351 TCGCGGCTCT TACCAGCCTA ACTTCGATCA CTGGACCGCT GATCGTCACG 

3401 GCGATTTATG CCGCCTCGGC GAGCACATGG AACGGGTTGG CATGGATTGT 

3451 AGGCGCCGCC CTATACCTTG TCTGCCTCCC CGCGTTGCGT CGCGGTGCAT 

3501 GGAGCCGGGC CACCTCGACC TGA ATGGAAG CCGGCGGCAC CTCGCTAACG 

3551 GATTCACCAC TCCAAGA ATT GGAGCCAA'C AATTCTTGCG GAGAACTGTG 

3b01 AATGCGCAAA CCAACCCTTG GCAGAACAT A TCCATCGCGT CCGCCATCTC 

3651 CAGCAGCCGC ACGCGGCGCA t C TCGGGCCG CGTTGCTGGC GTTTTTCCAT 

3701 AGGCTCCGCC CCCCTGACGA GCATCACAAA A ATCGACGCT CAAGTCAGAG 

3751 GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA 

3801 GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 

3851 TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 

3901 TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 

395 1 ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 

4001 CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC 

4051 TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT 

4 101 TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 

4 151 TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 

4201 ATCCGGCAA A CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 

4251 AGC AGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 

4301 TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT 

4351 GGTCATGAGA TTATC AA A A A GGATCTTCAC CTAGATCCTT TT A A ATTAAA 

4401 A ATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 

4451 AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT 

4501 TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC 

4551 GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA 

4601 CGCTCACCGG CTCCAGATTT ATCAGCAATA AACCAGCCAG CCGGAAGGGC 

4651 CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC CAGTCTATTA 

u7C,l ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 
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FIG. 20 (cont'd) 



4751 


AACGTTGTTG 


CCATTGCTGC 


AGGCATCGTG 


GTGTCACGCT 


CGTCGTTTGG 


4B01 


TATGGCTTCA 


TTCAGCTCCG 


GTTCCCAACG 


ATCAAGGCGA 


GTTACATGAT 


4851 


CCCCCATGTT 


GTGC A A A A A A 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


4901 


GTCAGAAGTA 


AGTTGGCCGC 


AGTGTTATCA 


CTCATGGTTA 


TGGCAGCACT 


4951 


GC AT A ATTCT 


CTTACTGTCA 


TGCCATCCGT 


AAGATGCTTT 


TCTGTGACTG 


5001 


GTGAGTACTC 


AACCAAGTC A 


TTCTGAGAAT 


AGTGTATGCG 


GCGACCGAGT 


5051 


TGCTCTTGCC 


CGGCGTCA AC 


ACGGGATAAT 


ACCGCGCCAC 


ATAGCAGAAC 


5101' 


TTTAAAAGTG 


CTCATCATTG 


GAAAACGTTC 


TTCGGGGCGA 


AAACTCTCAA 


5151 


GGATCTTACC 


GCTGTTGAGA 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


5201 


AACTGATCTT 


CAGCATCTTT 


TACTTTCACC 


AGCGTTTCTG 


GGTGAGCAAA 


5251 


AACAGGAAGG 


CAAAATGCCG 


CAAAAAAGGG 


AATAAGGGCG 


ACACGGAAAT 


530 1 


GTTGAATACT 


CATACTCTTC 


CTTTTTCAAT 


ATTATTGAAG 


CATTTATCAG 


5351 


GGTTATTGTC 


TCATGAGCGG 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA 


5401 


ACAA ATAGGG 


GTTCCGCGCA 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


5451 


AAGA A ACCAT 


TATTATCATG 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


5501 


AGGCCCTTTC 


GTCTTCAA 
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:S&368 backbone f~ I \D . £ I 

rsoluble T4#8 
rA-A <3 = UVS 

:"p«rf act" Stu/first 182 AA of T4 
rbaslcat 1 y up to V2J2 

bg393.seq Length: 5566 

1 GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGA AAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCA ATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGA A 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATA AAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AA ACTCTTCG 

551 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 

1001 TGGATCCAAG CTTCGACTCG AGGA ATTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCA A GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CAGGTCCCT A CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 

1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 

1301 AAAAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 

1351 ATACAATTCC ACTGGAA A A A CTCCAACCAG ATAAAGATTC TGGGAAATCA 
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FIG. 21 (cont'd) 



1401 


GGGCTCCTTC 


TTAACTAAAG 


GTCCATCCAA 


GCTGAATGAT 


CGCGCTGACT 


145 1 


CAAGA AGAAG 


CTTGTGGGAC 


CAAGGAAACT 


TTCCCCTGAT 


CATCAAGAAT 


1501 


CTTAAGATAG 


AAGACTCAGA 


TACTTACATC 


TGTGAAGTGG 


AGGACCAGAA 


1551 


GGAGGAGGTG 


CAATTGCTAG 


TGTTCGGATT 


GACTGCCAAC 


TCTGACACCC 


1 O W i 


ACCTGCTTCA 


GGGGCAGAGC 


CTGACCCTGA 


CCTTGGAGAG 


CCCCCCTGGT 


1 CC1 
X a <j i- 


AGTAGCCCCT 


CAGTGCAATG 


TAGGAGTCCA 


AGGGGTAAAA 


ACATACAGGG 


i7ni 


GGGGAAGACC 


CTCTCCGTGT 


CTCAGCTGGA 


GCTCCAGGAT 


AGTGGCACCT 


1 7*5 1 
i / j i 


GGACATGCAC 


TGTCTTGCAG 


AACCAGAAGA 


AGGTGGAGTT 


CAAAATAGAC 


l an 1 


ATCGTGGTGC 


TAGCTTTCCA 


GTGAGATCTT 


TGTGAAGGAA 


CCTTACTTCT 


i. D J X 


GTGGTGTGAC 


ATAATTGGAC 


AAACTACCTA 


CAGAGATTTA 


AAGCTCTAAG 


1901 


GTAAATATAA 


AATTTTTAAG 


TGTATAATGT 


GTTAAACTAC 


TGATTCTAAT 


1951 


TGTTTGTGTA 


TTTTAGATTC 


CAACCTATGG 


AACTGATGAA 


TGGGAGCAGT 


200 1 

4. VwJ A 


GGTGGAATGC 


CTTTAATGAG 


GAAAACCTGT 


TTTGCTCAGA 


AGAAATGCCA 


2051 


TCTAGTGATG 


ATGAGGCTAC 


TGCTGACTCT 


CAACATTCTA 


CTCCTCCAAA 


i in l 


AAAGAAGAGA 


AAGGTAGAAG 


ACCCCAAGGA 


CTTTCCTTCA 


GAATTGCTAA 


lIDI 


GTTTTTTGAG 


TCATGCTGTG 


TTTAGTAATA 


GAACTCTTGC 


TTGCTTTGCT 


"> ?n i 

A. £. \J 1 


ATTTAC ACCA 


CA AAGGA AAA 


AGCTGCACTG 


CTATACAAGA 


AAATTATGGA 


^ £ 3 i. 


A AAATATTCT 


GTAACCTTTA 


TAAGTAGGCA 


TAACAGTTAT 


AATCATAACA 


^ O W A 


TACTGTTTTT 


TCTTACTCCA 


CACAGGCATA 


GAGTGTCTGC 


TATTAATAAC 




TATGCTCAAA 


AATTGTGTAC 


CTTTAGCTTT 


TTAATTTGTA 


AAGGGGTTAA 


H U 1 


T A AfiGA ATAT 


TTGATGTATA 


GTGCCTTGAC 


TAGAGATCAT 


AATCAGCCAT 




ACCACATTTG 


TAGAGGTTTT 


ACTTGCTTTA 


AAAAACCTCC 


CACACCTCCC 


250 1 


CCTGAACCTG 


A AACATA AAA 


TGAATGCAAT 


TGTTGTTGTT 


AACTTGTTTA 


255 1 


TTGCAGCTTA 


TAATGGTTAC 


AAATAAAGCA 


ATAGCATCAC 


AAATTTCACA 


2601 


AATAAAGCAT 


TTTTTTCACT 


GCATTCTAGT 


TGTGGTTTGT 


CCAAACTCAT 


265 1 


CAATGTATCT 


TATCATGTCT 


GGATCCTCTA 


CGCCGGACGC 


ATCGTGGCCG 


2701 


GCATCACCGG 


CGCCACAGGT 


GCGGTTGCTG 


GCGCCTATAT 


CGCCGACATC 




AfCGATGGGG 


AAGATCGGGC 


TCGCCACTTC 


GGGCTC ATGA 


GCGCTTGTTT 


2801 


CGGCGTGGGT 


ATGGTGGCAG 


GCCCGTGGCC 


GGGGGACTGT 


TGGGCGCCAT 


2851 


CTCCTTGCAT 


GCACCATTCC 


TTGCGGCGGC 


GGTGCTCAAC 


GGCCTCAACC 


2901 


TACTACTGGG 


CTGCTTCCTA 


ATGCAGGAGT 


CGCATAAGGG 


AGAGCGTCGA 


2951 


CCGATGCCCT 


TGAGAGCCTT 


CAACCCAGTC 


AGCTCCTTCC 


GGTGGGCGCG 


3001 


GGGCATGACT 


ATCGTCGCCG 


CACTTATGAC 


TGTCTTCTTT 


ATCATGCAAC 
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"i™ FIG. 21 (cont'd) 

3051 TCGTAGGACA GGTGCCGGCA GCGCTCTGGG TCATTTTCGG CGAGGACCGC 

3101 TTTCGCTGGA GCGCGACGAT GATCGGCCTG TCGCTTGCGG TATTCGGAAT 

3151 CTTGCACGCC CTCGCTCAAG CCTTCGTCAC TGGTCCCGCC ACCAAACGTT 

3201 TCGGCGAGAA GCAGGCCATT ATCGCCGGCA TGGCGGCCGA CGCGCTGGGC 

325 1 TACGTCTTGC TGGCGTTCGC GACGCGAGGC TGGATGGCCT TCCCCATTAT 

3301 GATTCTTCTC GCTTCCGGCG GCATCGGGAT GCCCGCGTTG CAGGCCATGC 

335 1 TGTCCAGGCA GGTAGATGAC GACCATCAGG GACAGCTTCA AGGATCGCTC 

3401 GCGGCTCTTA CCAGCCTAAC TTCGATCACT GGACCGCTGA TCGTCACGGC 

3451 GATTTATGCC GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG 

350 1 GCGCCGCCCT ATACCTTGTC TGCCTCCCCG CGTTGCGTCG CGGTGCATGG 

355 1 AGC CGGGCC A CCTCGACCTG AATGGAAGCC GGCGGCACCT CGCTAACGGA 

360 1 TTCACCACTC CAAGAATTGG AGCCAATCAA TTCTTGCGGA GAACTGTGAA 

365 1 TGCGCAAACC AACCCTTGGC AGAACATATC CATCGCGTCC GCCATCTCCA 

370 1 GCAGCCGCAC GCGGCGCATC TCGGGCCGCG TTGCTGGCGT TTTTCCATAG 

3751 GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 

3801 GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC 

3851 TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC 

3901 CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 

3951 GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC 

4001 GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 

4051 TGAGTCCA AC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 

4101 GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG 

415 1 AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG 

4201 CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 

4251 CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG 

4301 CAGATTACGC GCAGAAAA A A AGGATCTC AA GAAGATCCTT TGATCTTTTC 

4351 TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 

4401 TCATGAGATT ATC AAAAAGG ATCTTCACCT AGATCCTTTT AAATTA A AAA 

4451 TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT GGTCTGACAG 

4501 TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 

4551 GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC TACGATACGG 

4601 GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG 

465 1 CTCACCGGCT CC AGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG 



WO 89/01940 



PCT/US88/02940 



67/93 



FIG. 21 (cont'd) 
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10 ] GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGA AGTA 
151 TGCAAAGCAT GCATCTCAA-r TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

:G) CCGCCCATCC cgcccctaac tccgcccagt tccgcccatt ctccgcccca 
25 1 tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 
30: ctgagctatt ccagaagtag tgaggaggct tttttggagg ggtcctcctc 

351 gtatagaaaC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

45 1 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGA A 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

55 1 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

b5 1 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGA AAAC 

751 C T CTCGAGA A AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

80 1 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 3TGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACA A 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCamC 

100) TGCATCCAAG CTTCGACTCG AGGA ATTCCC CGAAGGAACA AAGCACCCTC 

105 1 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

UOt GTTTGAGAAG CAGCGGGCA A GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

115 1 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 

1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTQCTTC TGGTGCTGCA 

1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 

130 1 A A A AAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 

135 1 ATACAATTCC A C T GGA AAAA CTCCAACCAG ATA A AGATTC TGGGAA ATCA 

l«Ci GGGCTCCTTC TTAA C TAAAG GTCCATCCAA GCTGAATGAT CGCGCTGACT 
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FIG. 22(cont'd) 
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FIG. 22(cont'd) 
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FIG. 22(cont'd) 
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